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I ntroa uct I on 


The Office of University Affairs of the National Aeronautics 
ana Space Administration (NASA) has signed a grant establishing a 
Remote Sensing Information Sciences Research Group (ISRG) at the 
University of California, Santa Barbara (UCSB). This document 
represents a progress report of work conducted under this grant 
(Grant # NASA NAGW-45P) during the period May 1, 1986 to January, 
1987, and proposal for the year starting May 1, 1987. 

ISRG research continues to focus on Improving tne type, 
quantity, and quality of Information which can be derived from 
remotely sensed data. As we move Into tne coming year of our 
research, we will continue to focus on Information science 
research issues. In particular, we will focus on tne needs or 
the remote sensing research and application community which will 
be served by the Earth Observing System (EOS) and Space Station, 
including associated polar and co-orbltlng platforms. Research 
conducted under this grant has been used to extend and expand 
existing remote sensing research activities at UCSB In the areas 
of georererenced Information systems, machine assisted 
Information extraction from Image data, artificial Intelligence 
ana vegetation analysis and modeling. 

As world population increases, there Is an ever expanding 
need for systems and techniques capable of acquiring. 

Integrating, and analyzing Information concerning the extent, use 
of, and changes In the major components of the earth's surTace. 
NASA Is playing an Important role In the development of systems 
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such as EOS which have significant data acquisition capabilities. 
To achieve the full potential of such systems, however, requires 
that farsighted fundamental research be directed towards the 
scientific application of technologies upon which assessments may 
be made of both the current and changing status of the components 
of the biosphere, hydrosphere, I Ithosphere, and atmosphere. 

The program of research, documented in this progress report. 
Is being carried forward by personnel of the University of 
California, Santa Barbara. This report documents our 
accomplishments In what we consider to be a multiyear effort to 
prepare to take full advantage of the system's capabilities of 
the platforms and systems associated with Space Spation (e.g., 
EOS). Through this work, we have targeted fundamental research 
aimed at Improving our basic understanding of the role of 
Information systems technologies and artificial Intel I Igence 
techniques in the integration, manipulation and analysis of 
remotely sensed data for global scale studies. This coordinated 
research program is possible at UCSB due to a unique combination 
of researchers with experience In all these areas. 

Efforts during the early years of this grant focused on tne 
integration of existing research activities at UCSB and the 
initiation and conduct of a number of research activities with a 
variety of NASA centers. We have also worked on background 
assessments of research and technology, as well as beginning 
steps towards implementation of a Pilot Land Data System (PLDS) 
for NASA HeadquarTers. We continue to be Involved In PLDS 
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development efforts, largely through the Science Steering Group. 
In addition, UCSB personnel have been involved with: the EOS DaTa 
Systems Panel; Space Station Data User Working Group; Space 
Station Operations Task Force; Science and Applications 
Information Systems Working Group; Global Resources Information 
Systems; the United Nations Environment Programs Global Resources 
Information Database program; and, the Committee on Data 
Management and Computation (CODMAC) of the National Academy of 
Sc I ence . 

In addition, during this past year we have received funds 
from NASA Code El to supplement ISRG activities. These funds 
were proposed In September of 1985 to cover a range of tasks. We 
have also received a major grant from NASA Code EE/El to study 
problems associated with Browse In the EOS EkA and have submitTed 
a proposal to Code EE to assist In system acquisition Integration 
ana operation of the United Nations Environment Programs (UNEP) 
Global Resource Information Database (GRID), Nairobi, Kenya 
Fac I I i Ty . 

The material which fol lows detai Is ongoing work directly 
aided by this grant. Several of the projects used this funding 
as a catalyst to aid other NASA offices In the research. In the 
Integration of remotely sensed and other data Into an Information 
sciences framework. The following sections discuss the derails of 
the projects dealing with: 

* Studies Related to I SRG/ I nf ormat 1 on Science Office 
Research ; 

* Artificial Intelligence and Geographic Information 
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Systems, KBGIS- II: A Knowledge-Based Geographic 
Information System; 

* Requirements and Principals for the Implementation and 
Construction of Large Scale Geographic Information 
Systems; 

* NASA Pilot Data System Comparison; 

* BROWSE of remotely sensed data In the 1990's; 

* Na I rob I Propose I ; 

* Band Ratios and Band Selection tor Mu 1 1 1 spectra I Image 
Generation; 

* A Glossary of Data System Terms; 

* An Agricultural Inventory Data Base for the Region of 
the Veneto, Italy; and, 

* University Research Libraries Group, Geographic 
Information System for Scholars. 

These projects are discussed In some detail In the following 
sections (and in associated Appendices) in terms of both ongoing 
and proposed activities for the coming year. This Progress 
report concludes with a section outlining the budget for this 
coming year's activities. The appendlcles which follow contains 
material expanding upon work from various sections of tne report; 
publications which have been funded (In whole or part) by this 
grant; committees memberships held by our staff with relevance to 
Information sciences; and, symposia and professional society 
papers are presented, related to work on this grant. 
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John E. Estes and Jeffrey L. Star 

In 1985, UCSB ISRG proposed sTudles to directly support tne efforts 
of NASA's Information Systems Office, Code El. Our proposal discussed a 
number disclpl Ine areas which continue to be relevant to Code El, and we 
have been In close contact with Code El staff to determine which of these 
tasK areas provide the greatest return to NASA and tne Information 
Systems Office. In the following text, we describe the different task 
areas and tne work accomplished In each area. 

The UCSB Information Science Group suggests the fol lowing topics for 
special studies for Code El: 

1. The interaction between the Code El daTa pliOTS and tne 
proposed EOS data system, and Impacts on the university 
earth science remote sensing community; 

2. University workstation activities and requirements as 
they relate to NASA Center and NASA Land and Ocean Data 
PI lots; 

3. University roles In the planning and development of tne 
OSSA Science and Applications Information System; 

4. The impact of the imposition of uniform NASA data 
standards on university research; 

5. University roles In the development of a Global 
Resources Information System; 

6. Artificial Intelligence and the Image Understanding Process; 

and, 

7. A Workshop on University Information Systems Requirements. 

A good deal of planning, research, and operational effort has 
occurred tnrough the Pilot Land, Ocean and Climate Data Systems, and 
somewhat independently, on the Earth Observing Systems data system. 
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Panels (with In some cases overlapping memberships) have convened on a 
number of occasions. 

However, there nas been little examination of the Interactions 
between pilots and the EOS data system, nor what such Interactions may 
Imply tor the academic Earth Sciences Community Involved In remote 
sensing research. Questions will Involve hardware, software, and 
communications compatibility Issues. Links to NASA Centers and to other 
national (plus state, and local) as wel I as International data bases will 
also De Involved. Universities will be an important part of the user 
community for the EOS and Pilot systems. However, what are the key 
Issues involved In facilitating the use of data from tnese systems by 
university scientists? What needs to be accomplished to Insure that this 
Important segment of the university community can participate to tne 
ful lest extent In the research opportunities afforded by these systems? 

NASA, through Code El, Information Systems Office (ISO) has conducted 
studies over the past decade which have lead to the establishment of a 
number of pilot data systems. These pilot data systems are oriented 
towards the science driven data/ l nformatlon needs of users In defined 
disciplines under the NASA research umbrel la. To daTe, pliOT systems have 
been undertaken in the planetary, climate, oceans and land sciences areas. 

The goal of these systems Is to provide a variety of types of 
support for NASA and NASA related researchers. This support Includes the 
maintenance of directories and catalogues, the formatting and 
preprocessing of data and the facilitation of communications between 
discipline scientists conducting NASA research. 


NASA Is also planning for Earth Observing Satellite System (EOS) 
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EOS is designed to aid scientists in addressing the multidisclpl Inary 
earth science problems. The system must provide relevant information tor 
detailed local studies, as well as for investigations aimed at an 
improved understanding of global dynamics. Recognizing the importance ot 
such a system to scientific understanding, the EOS Science and Mission 
Requirements Working Group recommended that NASA should develop tne Earth 
Observing System as an information system mission. 

University researchers will make up an Important element of the user 
community for Information from both the Data Pilots and EOS. Yet, in the 
studies conducted to date, little attention has been focused on eitner 
the specific needs or the generic problems involved in achieving optimum 
university participation in such systems. We propose to conduct such a 
study. 


Specif leal ly, we have examined the operation of current NASA plioT 
systems as they pertain to Interactions with university researchers. 

A prei iminary version of this material is presented in a fo I lowing 
section of this report. The report discusses the similarities and 
differences in the various pilots, it also discusses now the piioTS have 
been organized in support of their diverse user communities and the level 
of satisfaction on the part of both "users" and "operators" of the plioT 
systems. 

Our work in this area is continuing. In the coming months and InTo 
next year we wil I broaden somewhat the scope of this activity to examine 
what reatures of these pilots support the EOS Data System Concept as 
outlined in the Reports of the EOS Data Panel. The study will also look 
Into how these existing pilots might be more ettectlvely I inxed to tne 
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broader NASA sponsored University Community. 


5 


i 


Research Act! vines for the Information Systems Office 


Science and Applications Information System 

Another area of Increased Interest Is large scale Information 
systems. The Science and Applications Information System planning effort 
Is currently being conducted within NASA Code E. This system, as 
currently envisioned. Is being planned to answer the science and 
applications information systems requirements of NASA and NASA associated 
researchers working under Code E funding. This wide ranging effort Is 
critical for effective and efficient use of EOS and other advanced 
systems being developed for use In the Space Station time frame of the 
1990's and beyond. 

We proposed to provide a university perspective to tne SAIS planning 
process. We feel this Is an Important area where study and Interaction 
witn tne University community Is required. This has Included working 
closely with NASA Headquarters and center personnel who are actively 
involved In the planning process. Questions regarding links to 
universities, transfer of data, data archiving, value added services, and 
standards are all of critical concern. The role of existing pi lots and 
university beta test sites also should be addressed. For example, should 
SAIS planning be scenario driven? Expandability, heterogenous networks, 
and system evolution are al I Important concepts to the SAIS. As a result 
of our efforts in this area. Dr. Estes and Dr. Star are beginning to work 
closely on SAIS activities with NASA and other university personnel. 
Indeed, a major SAIS working meeting will be held the week of February 9, 
1987 at UCSB. At this meeting, a cross-section of NASA and NASA- 
sponsored scientists wil l meet to be brought up-to-date on me Space 
Station program, and to focus on refining the structure of an SAIS 
system. 
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We at the ISRG will continue to support SAIS activities through 
reviews of project plans and attendance at SAIS science and working group 
meetings. In addition, our activities In the development of an Image 
processing test Ded will directly support our work, testing the SAIS 
concept for Telescience use of Space Station polar platforms In the EOS 
era. 

Data Standards 

Data standards have been discussed by many NASA and other federal 
agencies as well as science community committees for a considerable 
length of time. The imposition of standards Implies that some change In 
operational procedures will be required to Insure compatab 1 1 I ty. What 
impact will changes in data structures or headers have on university 
research and operations? To some extent, this can be examined within the 
context of changes which have occurred In Landsat data formats. The 
problem here, however, ts much broader and the Issues and impacts upon 
the university community need to be addressed. 

We have been working with Jet Propulsion Laboratory personnel on 
data standards for the university earth science community. In this 
study, we are trying to determine both the advantages and potential 
disadvantages of establshing standards. The orientation of the study 
will be to consider a minimum set of standards which wi I I Improve and not 
impede universities' abilities to employ NASA acquired remote sensor 
Information In an effective and efficient manner. 

The study has examined one proposed data format family, based on a 
data description language. We have written software for both the IBM 
PC/AT and the VAX under Unix, to test the efficiency as wel I as tne 
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logical structure of the proposed data standard. This part of the effort 
will continue at a lower level In the second halt of tne current contract 
year, as the standard and our software undergo review. Current practices 
ana procedures tor acquisition, processing, analysis, reporting, ana 
storing of Information will be reviewed. Areas where standards can be 
employed to improve this process will be discussed, including 
communication between sites and the development of geographic Information 
systems. We wil I also discuss the role of standardized software 
Interfaces. Emphasis will be placed on determining a minimum set of 
standard formatting and processing procedures which wil l provide the 
greatest benefit to the university community. 

Global Resources Information System 

In recent years, NASA has been moving towards conducting global 
scale studies with time durations on the order of decades. Study 
documents in tne rorm of program plans have been developed for land- 
related global habitability studies and for global biology studies. In 
addition, the Space Station and its associated sensor plaTforms are being 
justified In part on the basis of long term research studies on topics of 
global significance and concern. The Earth Observing System (EOS) 

Science Steering Group document clearly points out the need for such 
studies. This same document also poinrs out the need to consider EOS as 
an Information system. 

ISO, at the urging of congress, has been developing the concepT of a 
Global Resources Information System (GRIS) for the past several years. 
These studies have been lead by researchers aT tne Jet Propulsion 
Laboratories. There Is clearly a need for such systems. That need 
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extends from the use of such data as might be contained In such a system 
In a policy making context. In a commercial context, and In a research 
and development context. While the two former uses (policy and 
commercial) are important, a case can be made that tne research and 
development aspects of the data In such a system would be extremely 
significant. The question asked then is, who will be the real 
beneficiaries of such a system? It Is clear that among the top 
beneficiaries will be university researchers. As one of these, we 
propose a study of how universities can contribute to the 
Research Activities tor the Information Systems Office development of a 
Global Resources Information System. 

A strong case can be made that some of the greatest beneficiaries of 
a Global Resources Information System (GRIS) will be found In the 
university research community. Policy makers and federal agency 
personnel will make extensive use of these data, but It wll I be 
university researchers, doing advanced work In analysis and model ling, 
who stand to reap significant near-term benefits from the global harvest 
of data wnich can be provided by such a system. As such. It Is Important 
to understand the roles which can be played by universities In the 
development of a GRIS-type system and to address the Issue of university 
participation In the development and operation of Global Scientific 
Information Systems. 

As such, UCSB personnel have visited and will continue to work 
closely with personnel of the UNEP/GRID program In Geneva, Switzerland, 
as we I I as with personnel of the World Bank and NASA on Support for 
Global Science and Applications research. An outgrowth of this work was 
the placing of an Intern at the World Bank this past year. Our student 
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worked toward his Master's Degree, and was responsible for operating an 
Image processing system at the World Bank Headquarters in Washington, 

D.C. Our student, Mr, Paul Lefebvre, participated In training of Bank 
employees as wel I as participating directly in tne development of 
materials for Bank projects In developing nations. This was part of an 
effort at me World Bank, evaluating the utility of remoTely sensed daTa, 
as well as developing some In-house experience with the Information 
systems requirements. At the present time, we are In the process of 
negotiating another Intern for the World Bank. Bank personnel have 
expressed considerable Interest In this. We hope our eftorTs In tnis 
area are successful. 

In addition, we at UCSB working with personnel from Code EE and tne 
National Science and Technology Laboratory at Bay St. Louis, Mississippi, 
have proposed an effort to support the upgrading of computed facilities 
and the UNEP/GRID facility In Nairobi, Kenya. The proposal for this 
effort is contained In an appendix to this report. We remain InTerested 
in and are actively pursuing Improved Insights Into the Institutions, 
techniques and methods currently available for tne conduct of large scale 
and global science. We feel our activities In this area are progressing 
well. This work will continue InTO The coming year tnrough our proposal 
efforts. In addition, we wil I continue to examine ways to Improve 
science and applications data linkages on the i nTernati ona I level. This 
work Is Important If we are to move toward the goals laid out by the 
Earth Science Steering Committee In the Earth System Science Report. 

Artificial Intel I Igence 

Image analysis Is a process which Is integral to al I remote sensing 
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systems. Despite advances In computer-assisted Image processing, the 
Information extraction potential Inherent In digital image processing 
lags behind the level of Information extraction exhibited by manual Image 
Interpretation Techniques. An examination of the processes involved In 
computer-assisted versus manual Image Interpretation reveals that 
computer assisted techniques rely largely on a single element, 
specif teal ly tone or color, and a primitive form of texture; whereas 
manual interpretation Techniques Incorporate much higher level Inputs 
(e.g., context). 

Since concepts from the field of artificial intelligence (Al) have 
proven useful In applications similar to those found in the 
Interpretation ana analysts of remotely sensed images, UCSB has been 
conducting research for several years to identify specific research 
directions in which Al techniques can be applied to remote sensing with 
potential high returns. Much of this work has been driven by Prof. T. 
Smitn, co-investigator on this grant. This leads us to propose a STudy 
In which some of the fundamental research required in the development of 
a computer assisted interpretation system is conducted. 


Since the data inputs to an image Interpretation system are by 
necessity varied and complex, a reasoning model must first be 
developed that incorporates the various data forms and models their 
Interactions and preferred order of occurrence. This Involves tne 
assemb I ance and structuring of the knowledge of expert image 
Interpreters, and the formation of information STruchTures which 
allow efficient storage and access to the varied data forms. The 
data may be digital (Image or other col lateral data) or semantic In 
nature, and may concern either the appl I cations area or background 
information concerning the region to be analysed. 

In order to optimize the efficient search through a large and 
complex knowledge base, a system must have some way of prioritizing 
the processing Instructions. One way to develop this prioritization 
Is by conduct long a case STudy of the computer assisted processing 
and analysis of land use/ land cover classification studies, in order 
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to develop reasonable a priori probabilities for the Initial 
program. These a priori probabilities could then be Incorporated 
Into the decision model to Indicate the preferred processing flow. 

Lisp is often cited as the language of choice In an A I program. 
However, In a digital Image processing context, where simply an 
excess of Input/output initiations can produce an Increase In the 
cost of running an Image processing program by an order of 
magnitude. Lisp may not be the most appropriate programming 
language. A study is needed that wil I examine the unique problems 
Involved in the data structuring, data Interaction ana digital 
processing to determine which programming language would be optimal 
for that data or instructional module. The languages which would be 
examined for this study would Include Lisp, Prolog, C and Fortran. 

It is anticipated that different modules would be optimal ly 
represented In different languages. This could be followed by a 
cost/benefiT analysis to Integrate the development and operational 
considerations Involved In a computer-assisted analysis system in 
order to reveal the programming direction to task In tne design of 
such a system. 


Conclusion 

In conclusion, we believe that our work in this area represents a 
particularly efficient mechanism for the UCSB ISRG to support NASA»s Code 
ES. With the decision on which of the technical and scientific tasks are 
most Important remaining with Code El staff, we are able to respond to 
changing needs and requirements at Headquarters on a timely ana efficient 
basis. We look forward to this relationship, and to the continuing 
opportunity of supplying a university perspective to Code El. 
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NASA has funded pilot data systems In the ocean, climate, 
planetary and land sciences disciplines. The data pIiots are 
designed to examine the acquisition, storage, analysis, and use 
of remotely sensed data in the different disciplines. To date, 
there has been little effort spent to examine the Important 
features of each of these test efforts, to compare ana contrast 
the different approaches taken, and to consider the commonalities 
ana alfferences between them. 

The fol lowing chapter is a prel i mi nary attempt at Just such 
a comparison. As part of our efforts In 1987, we propose to 
rerine this document, by examining the successes of each of tne 
pilot programs, and Identifying the areas where additional work 
could be beneficial to NASA’s new efforts in tne information 
sciences. 

It Is our feeling that such work Is critical If we are to 
use the existing pilots as the foundation, the building blocks, 
upon wnich the EOS data system Is to be developed, as some have 
suggested . 
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COMPARISON OF PILOT DATA SYSTEMS 


1* Background 

The National Aeronautics and Space Administration (NASA) has developed four infor- 
mation systems to serve as prototypes for demonstrating principles, techniques, systems, and 
capabilities for the assembling, storing, management, and dissemination of discipline-specific 
remotely sensed data sets. One of these pilot systems, the Pilot Ocean Data System (PODS) 
has been implemented by NASA as NODS, the NASA Ocean Data System. The Pilot Cli- 
mate Data System (PCDS), while also considered operational, still carries the term ‘pilot’ in 
its title. Different codes within NASA have lead responsibility for each system. To date, a 
thorough comparison of important features of the four data systems has not been made. This 
paper addresses that situation by comparing system components. Hopefully, the systems will 
benefit from this comparison by drawing on each others’ experience. 

Each data system supports major discipline areas within NASA— lands, oceans, climate, 
and planets. As stated above, NODS and PCDS are currently operational. The Pilot Land 
Data System (PLDS) and the Planetary Data System (PDS) will have preliminary versions 
operating in 1987. The objective of these pilot projects is to make the large (but widely dis- 
tributed) collections of satellite and ancillary observations of interest to NASA and NASA- 
related discipline scientists more readily accessible to researchers who may not even know 
what data are available or how to get hold of them. Future data systems such as the Science 


University of California, Santa Barbara 


- 2 - 


Draft~l/87 


and Applications Information System (SAIS) and the Earth Observations System (EOS) will 
integrate the pilot systems and expand upon them. 

The need for improved management of satellite and in-situ data sets has emerged from 
many related factors. The public has made enormous investments in space-related research. 
Large volumes of data accumulated represent a national treasure that cannot be replaced and 
should be preserved. Concurrently, the potential scientific value of the data means these data 
sets should be accessible to researchers. Typically, however, once the active phase of the 
research satellite mission has ended, the resources available to manage the data diminishes. 
Data is then archived in scattered locations, in non-uniform formats, with undocumented pro- 
cessing, and therefore is not adequate for use. It was in recognition of this potential loss of 
priceless data, combined with the anticipation of more data from the Earth Observing System 
on Space Station in the 1990*8, that induced NASA to promote these four pilot systems. 

The discussion begins with a brief description of each system-- it’s mission, research user 
community, data sets, and test bed applications. Following that, the basic functional com- 
ponents of each system will be compared. This task is complicated by differences in terminol- 
ogy between systems and by differences in their hierarchical organizations, and by differences 
in the details in the documentation. Next, the individual functions will be covered in some- 
what greater detail (e.g., means of access, user interface, data manipulation and display, and 
other functions such as bibliographic search and on-line help). 

The primary intent of the material presented here is to gain a greater appreciation for 
the diversity of approaches in establishing the NASA data pilots and to gain an improved 
understanding of the fundamental differences and basic similarities in these systems. Further 
work will be required to address questions such as which parts of which systems are particu- 
larly well suited for a given task or function. Can standard approaches to problems evolve 
out of these systems; how can lessons learned from one system be incorporated effectively into 
others; what parts of these systems can be used to form an EOS data system; is that even 
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appropriate; and so on? 

2. Pilot Data System Descriptions 

Pilot Climate Data System 

The collection of climate research data has been a major focus within NASA. PODS 
was the first NASA data system to provide on-line capability to users. It was designed and 
scoped in 1980 and began test operations in June, 1982. The newest revision, Version 3.5, 
was installed in September, 1986. Current research applications being supported include Sta- 
tistical Climatology, International Satellite Cloud Climatology Project, Global Ozone Distri- 
bution, Earth Radiation Budget Studies, Land-Surface Climatology, and Solar Flux. By the 
end of 1987, it is expected that PCDS will be declared operational, with a name change to the 
NASA Climate Data System (NCDS). 

The goal of PCDS is to develop a system for locating, obtaining, manipulating, and 
displaying climate data of interest to NASA’s research community. Specific objectives are to 
develop a technically advanced system, to obtain and incorporate the most important data 
sets, and to support a broad range of users. 

PCDS operates on a VAX 11/780 at NASA’s Goddard Space Flight Center (GSFC), 
housing both the software and NASA and non-NASA data sets. The system can be accessed 
24 hours per day, seven days a week, except during scheduled maintenance. While PCDS has 
no specific science advisory panel participating in its management, the scientific community 
has interacted frequently, including two workshops held at GSFC. Mary G. Reph is the 
PCDS Project Manager. The contact address is: 

Pilot Climate Data System 

NASA Goddard Space Flight Center 

National Space Science Data Center 

Data Management Systems Facility 

Code 634 

Greenbelt, MD 20771 
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(301) 286-5037 


NASA Ocean Data System 

The data system to serve the oceanographic community began as the Pilot Ocean Data 
System (PODS) but was declared operational as NODS. The latest major system release was 
installed in February, 1986. 

NODS was designed to give the oceanographic community access to a variety of 
remotely sensed and in situ data and functions to preview the data. 

The system is managed at the Jet Propulsion Laboratory (JPL), currently providing 
remotely sensed data sets, both on-line and off-line and also on-line in-situ data sets. NODS 
is available 24 hours a day, seven days a week, except during scheduled maintenance. 
Current recommendations from the Advisory Panel of the Satellite Ocean Data Systems Sci- 
ence Working Group emphasizes that NODS should be part of a distributed data retrieval 
system and not focus on data analysis. Panel members felt that analytical functions are 
already available and that work on incorporating them into NODS would detract from the 
primary effort of data retrieval. The Project Manager is Dr. Eni Njoku and the Project 
Scientist is Dr. Victor Zlotnicki. The contact address is: 

The NASA Ocean Data System 

Jet Propulsion Laboratory 

MS 202-101 

4800 Oak Grove Drive 

Pasadena, CA 91009 

(818) 354-5036 

FTS 792-5036 

Planetary Data System 

PDS originally began as two separate tasks in 1983. The Pilot Planetary Data System 
was established to evaluate information systems and planetary science research methodolo- 
gies. The other task, called the Planetary Data System, began defining the requirements of 
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an operational planetary data system. The two projects were later merged and named the 
Planetary Data System. Version 1 of the system is scheduled to begin operating in mid-FY 
87, and Version 2 by the end of FY 89. 

The overall goal of PDS is to serve as curator of planetary data, providing both 
increased longevity of data and knowledge of their meaning, and to improve the organization 
of and access to these data in order to facilitate scientific investigation and analysis. A more 
specific objective of PDS is to develop and implement a system, based on the recommenda- 
tions of the Committee on Data Management and Computation (CODMAC), which can pro- 
vide easy access to planetary data and to information about the data to the scientific com- 
munity. 

Project responsibility is assigned to JPL, where the high-level catalog and archives are 
maintained on a VAX 11/780. It is designed to be a distributed system with a central node 
at JPL which maintains the high level catalog and the archives on a VAX 11/780. Discipline 
nodes, located at installations actively doing planetary research, maintain mature data sets 
and expert disciplinary knowledge. The discipline nodes are organized around four 
subdisciplines— geology, rings, atmosphere, and fields and particles. Mission nodes provide the 
interface between Planetary Flight Centers and PDS. A Scientific Working Group partici- 
pates in the oversight of PDS. Dr. William Kurth, a research scientist at the University of 
Iowa, has replaced Dr. Ray Arvidson as Project Scientist. Project Manager for PDS is Dr. 
Tom Renfrow at JPL. Contact address is: 

Planetary Data System 
Jet Propulsion Laboratory 
4800 Oak Grove Drive 
Pasadena, CA 91009 
(818) 354-6347 
FTS 792-6347 


Pilot Land Data System 
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Planning for a pilot data system to serve earth scientists began in 1983. Preliminary 
users requirements have been drafted. An initial operating version of PLDS, called Build 1, is 
scheduled to go on-line in July, 1987. Development will continue through 1989. The three 
test bed projects in the development phase are the International Satellite Land Surface 
Climatology Project Retroactive Analysis Project (ISLSCP-IRAP), the First ISLSCP Field 
Experiment (FIFE), and the Sedimentary Basins Project (SEDBAS). A fully operational Land 
Data System is anticipated to be on-line around 1991. 

PLDS was created to provide a limited scale demonstration of an information system 
that would offer scientists the ability to archive, locate, transfer, integrate, and manipulate 
data in a distributed fashion. 

PLDS is managed at NASA’s GSFC, although it is not yet accessible to users. Data 
storage will be distributed at existing archives such as EROS Data Center, USGS/ESDD, 
USGS/WATSTOR, and NOAA/NEDRIS. In the preliminary demonstrations of PLDS, scien- 
tists can access these archives but will not be able to query or command satellite sensors 
directly. This type of capability is being examined for the EOS time frame, however. Dr. 
Paul Smith at GSFC is Project Manager; Dr. Robert Price, also at GSFC, is Project Scientist. 
The address is: 

Pilot Land Data System 
NASA Goddard Space Flight Center 
National Space Science Data Center 
Data Management Systems Facility 
Code 634 

Greenbelt, MD 20771 
(301) 286-5037 

3. Functional Components 

Organizing a comparison of pilot data systems is not a clear cut task. The pilots are in 
different stages of development, (see Figure 1), so the level of specificity varies. 

Each system uses a slightly different terminology for similar functions. Some call a function a 
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Figure 1. Pilot Data System Development Timelines 


78 80 82 84 86 88 90 92 



component or a major subsystem, while others simply consider the same function to be an ele- 
ment of a larger component. In other words, the functional hierarchies are quite different. 
Indeed, throughout this entire area there is a real need to establish, in so far as possible, a 
glossary containing the agreed definitions of a variety of terms. Such a glossary would 
improve and facilitate communications among data system scientists and technicians, discip- 
line scientists, and researchers. To allow comparison, this paper focuses on and identifies six 
typical subsystems of a data system, plus an ‘all other’ category for miscellaneous and admin- 
istrative functions. These subsystems or components are: 

1. User Interface 

2. Catalog and Directory 
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3. Data Validation 

4. Data Retrieval 

5. Data Manipulation 

6. Data Display 

7. Other 

Each of these subsystems can in turn be broken down into subunits in a variety of ways. 
These will be discussed in more detail in the sections which follow. Table 1 summarizes these 
functional components of the four pilot data systems. Cases identified with an asterisk indi- 
cate that the system has some capability in the given function but that it is not considered a 
separate subsystem. 

Note that NODS has no subsystem for data manipulation. As mentioned above, the 
Science Working Group overseeing NODS found that at present, adequate capability in 
oceanographic data processing and analysis and wanted to see NODS focused on data 
retrieval. Also notice that there is no data acquisition subsystem listed above. In all four 
data systems, the mission planning aspects are detached from the data management func- 
tions. The focus of the pilot studies is on management of data from past and upcoming mis- 
sions, rather than on recommending missions to collect missing data. 

The details of each of these subsystems for each data system will be compared in the 
following sections. These comparisons should be taken with some caution, however. These 
are pilot systems, the details of which are still evolving. The absence of a particular capabil- 
ity or function indicates only means that, at this time, it was not found in the available docu- 
mentation. It cannot be asserted with certainty that the function has been consciously or 
unconsciously omitted by system planners. 

User Interface 
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Table 1. Functional Components/Subsystems 


Function 

PLDS 

NODS 

PCDS 

PDS 

User 

Interface 

* 

Menu/ 

Command 

Interface 

* 

Access 

Data 

Catalog/ 

Catalog, 

Global 

Catalog, 

Inspect 

Directory 

Directory 

On-Line 

Data 

Catalog 

Inventory 

Data 

Data Input/ 
Validation 


★ 

* 

Prepare 

Data 

Data 

Data 

Data 

Data 

Inspect 

Retrieval 

Reduction 

Archive 

Access 

Data 

Data 

Data 


Data 

Inspect 

Manipulation 

Reduction 


Manipu- 

lation 

Data 

Data 

Display 

Graphics 

Data 
Display 
& Product 
Generation 

Graphics 

Inspect 

Data 

Other 

* 

* 

* 

* 


* system has this capability but it is not as yet nor planned to be a major com- 
ponent 


PLDS = Pilot Land Data System 
NODS = NASA Ocean Data System 
PCDS = Pilot Climate Data System 
PDS = Planetary Data System 


The User Interface component generally refers to how a scientist can access the data 
system and interact with it. We will consider three aspects. First, the actual user interface- 
how the user tells the system what to do— is described. Then the variations in the means of 
accessing the system and the user hardware requirements are compared. 

As shown in Table 2, all four pilot systems allow both menu-assisted operations for 
beginners and a command language for experienced users. PLDS further proposes allowing 
natural language queries in some future revision. Most offer an on-line Help function, and 
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PCDS even has an on-line tutorial session. All four pilot systems allow queries by some com- 
bination of parameters such as date, location, data type, platform, sensor, and so on. How- 
ever, the details are not given to the same degree of specificity, so an adequate comparison 
cannot be made at this time. 


Table 2. 

User Interface Capabilities 


Capability 

PLDS 

NODS PCDS 

PDS 

Menu 

• 

• • 

• 

Command 

Language 

• 

• • 

• 

Natural 

Language 

? 



On-Line Help 


• • 

• 


PLDS = Pilot Land Data System 
NODS = NASA Ocean Data System 
PCDS = Pilot Climate Data System 
PDS = Planetary Data System 

The data systems vary in their means of access (see Table 3). All four will allow use of 
a packet-switching network like TELENET. All but NODS allow direct dial-up over a 
modem, and all but PLDS use the Space Physics Analysis Network (SPAN). PLDS and PDS 
will include a local area network, and PLDS anticipates future use of satellite communica- 
tions. 

Hardware requirements are intentionally kept simple (see Table 4). Most systems only 
require the user to have any ASCII terminal or a personal computer with 
emulator/communications software and a modem. A scientific workstation is the expected 
device for PLDS. Printers, plotters, or graphics displays are optional, although, of course, the 
capability to view output displays depends on some combination of them. 

Catalog and Directory 
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Table 3. 

Methods of Access 



Method 

PLDS NODS 

PCDS 

PDS 

Local Area Network 
at Central Node 

• 


• 

Local Area Network 
at Satellite Node 



• 

SPAN Network 

• 

• 

• 

TELENET Type Network 
(packet switching) 

• • 

• 

• 

Direct Dial/Modem 

• 

• 

• 

Satellite 

Communications 

? 




PLDS = Pilot Land Data System 

NODS = NASA Ocean Data System # 

PCDS = Pilot Climate Data System 
PDS = Planetary Data System 


Table 4. Required User Hardware 
Device Type PLDS NODS PCDS PDS 

Terminal 
or PC 

• 

• 

Modem 

• 

• 

Scientific 

Workstation 

• 


Printer 

* 

* 

Plotter 

* 

* 

Graphics 

Display 

♦ 

* 


• = required hardware 

* = optional hardware 


PLDS — Pilot Land Data System 
NODS = NASA Ocean Data System 
PCDS = Pilot Climate Data System 
PDS — Planetary Data System 


i 
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The catalog or directory component clearly illustrates the lack of uniformity in pilot 
data systems. PLDS tentatively has separate Directory and Catalog subsystems. The Direc- 
tory will contain detailed information about the data sets, while the Catalog will have more 
general descriptions about the data collections. The Global On-Line Data Catalog in NODS 
has two levels. The top level, comparable to PLDS ? Directory, can be queried on data param- 
eters such as platform, sensor, location, and time. The lower level includes more general 
information about the data sets, similar to PLDS’ Catalog subsystem. PCDS has two 
separate subsystems like PLDS does, but calls them Inventory and Catalog. In PDS, the 
Inspect Data subsystem includes a catalog function as well as the data manipulation and 
display functions. 


Data Validation 

Only PDS has a specific component for inputting and validating data. Quality control 
checks, specified by the planetary science community, are performed. Errors are corrected 
internally, when possible, or else the data are returned to the originator. Of course, the data 
system user will not interact directly with this subsystem. However, it will be crucial to the 
long-term success of future information systems that data sets are carefully chosen and that 
their quality is validated and described in the catalog. 

Data Retrieval 

Retrieving data from the archives is a significant aspect of each pilot system. Each has 
a subsystem dedicated to data retrieval, except for PDS which includes it in a comprehensive 
Inspect Data component. Basically, all systems allow queries by mission or platform, by sen- 
sor, by date and time, by geographic location, or by data type, or some combination of logical 
‘and’ and *or’ operators. 
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Data Manipulation 

The four pilot systems vary significantly in their emphasis on analytical functions (see 
Table 5). In PLDS, it is anticipated that functions for radiometric and geometric corrections 
and for image subsetting will be provided. Other functions such as statistics, modeling, image 
processing, and geographic overlay may also be available. NODS, on the other hand, will 
provide almost no analytical capabilities, but will focus instead on data base management 
aspects. PCDS is limited to customizing data sets for further processing at the user’s node or 
preparing data for the graphics subsystem using statistical functions. PDS will incorporate 
functions for statistical and mathematical analysis, sampling, and elementary image process- 
ing. The responsibility for software development in PDS is assigned to the discipline nodes. 


Table 5. 

Data Manipulation Capabilities 


Capability 

PLDS 

NODS PCDS 

PDS 

Statistics 

• 

• 

• 

Data Sampling 



• 

Math Functions 


• 

• 

Modeling 

• 



Image 

Processing 

• 


• 


PLDS = Pilot Land Data System 
NODS = NASA Ocean Data System 
PCDS = Pilot Climate Data System 
PDS = Planetary Data System 


Data Display 

The final output of the data systems is their real objective. All four systems identify 
this as an essential aspect by making a separate subsystem (although PDS includes other ele- 
ments within its Inspect Data subsystem). As shown in Table 6, the displays include tabular 
and textual data, graphics, imagery, and in NODS, bibliographic abstracts. Naturally, the 
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capabilities are dependent on the user's hardware, although custom products or tapes can be 
produced at the central node and shipped to the user. Some graphic products may necessarily 
be performed at the central node because of the transmission times. For instance, it would 
take 45 minutes to transmit a 512 x 512 pixel image with 8-bit pixels over a 1200 baud line 
(Jet Propulsion Laboratory, 1986). 


Table 6. 

Display /Graphics Capabilities 


Capability 

PLDS 

NODS 

PCDS 

PDS 

Text 

• 

• 

• 

• 

Abstracts 


• 



2D Graphics 

• 

• 

• 

• 

3D Graphics 



• 

• 

Imagery 

• 

• 


• 

Browse 

• 

• 

• 

• 


PLDS = Pilot Land Data System 
NODS = NASA Ocean Data System 
PODS — Pilot Climate Data System 
PDS — Planetary Data System 

Browsing capability is proposed for all four data systems. NODS will have Browse func- 
tions both for looking at examples in the Catalog subsystem and at data in the Archive sub- 
system. In PCDS, Browse refers to scanning detailed catalog descriptions. PDS will utilize 
optical videodisks for electronic Browse of planetary images stored at regional facilities. 

The role of a Browse function is unresolved and may well be different for each data sys- 
tem because of the type of data and because of differences in user requirements. The tradeoff 
between storage of large numbers of preprocessed images and the time time of processing only 
on request need to be examined. Related to this issue is determining what types of browsable 
imagery the scientific community wants, realizing there are a vast number of possibilities. 
These issues are being addressed in a current research study at the Remote Sensing Research 
Unit of the Department of Geography, University of California, Santa Barbara. 
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Other Functions 

This section describes the capabilities of the pilot data systems in several miscellaneous 
areas, including bibliographies, electronic mail, and administrative details (see Table 7). 



Table 7. Other Functions 


Function 

PLDS NODS 

PCDS 

PDS 

Bibliography 

* Biblio- 

graphy 


★ 

Electronic 

Mail 

* 


* 

Adminis- 

tration 

* * 

* 

Order Data, 
Distribute 
Data, 

Administer 

Data 


* system has this capability but it is not as yet or planned to be a major com- 
ponent 

PLDS = Pilot Land Data System 
NODS = NASA Ocean Data System 
PCDS = Pilot Climate Data System 
PDS = Planetary Data System 

PLDS and PDS will have some bibliographic data available. Only NODS, however, 
actually has a separate subsystem just for bibliographic data. The bibliography can be 
queried by author, title, subject, and so on, with the results of the query sent to the user. 

Both PLDS and PDS anticipate some form of electronic mail to allow communications 
between users and between the system and the users. For instance, news of system revisions 
and new data sets can be distributed to all users. 

Administrative functions such as ordering and distributing data will be provided by all 
four systems. PDS emphasizes this aspect with three of its five subsystems directed towards 
these administrative details. 


University of California, Santa Barbara 



-16- 


Draft— 1/87 


4, Conclusions 

The four pilot data systems in NASA have been developed more or less in parallel and 
somewhat independently. They are being implemented by different NASA centers and over- 
seen by separate scientific steering committees. Hopefully, this report has provided a greater 
appreciation for the diversity of approaches in developing the NASA data pilots and for the 
differences and similarities in the products. 

A major task remaining prior to integration of the pilots into an Earth Observations 
Information System is to evaluate each of pilot system. Important questions remain as to 
whether researchers will actually use and benefit from distributed data systems. The develop- 
ment of a Browse capability, currently underway at the University of California, Santa Bar- 
bara, will be crucial for allowing scientists to preview data at their local terminals to deter- 
mine if it meets their needs. To accomplish this task will first require a survey to find out 
how researchers decide when data is adequate, e.g. by visually inspecting raw or transformed 
imagery, by summary statistics, etc. 

Nevertheless, NASA has already made great strides towards designing a vehicle to make 
satellite data accessible by researchers in the land, ocean, climate, and planetary sciences. 
The wealth of data from past and future missions may begin paying dividends far beyond the 
scope of the original research mission objectives, particularly when data from different sources 
can be integrated synergistic ally. 
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In particular, with new aircraft sensors such as AIS and AVHRIS, 
and proposed spaceborne sensors such as HIRIS, future image 
analysts will have many more spectral channels than In the past, 
and Tnus, many more possible band ratios. This paper examines 
the deslreable characteristics of band ratios, and suggests 
algorithms for their production. 
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INTRODUCTION 

Band ratioing is an image enhancement procedure in 
which the radiance values of two bands of a multispectral 
image are divided, one by the other on a pixel by pixel 
basis, in order to produce a pseudoimage known as a "ratio 
image". Descriptions of specific band-ratioing procedures 
are scarce in the literature, and it is clear that many 
researchers have produced ratio images that fall short of 
maximizing overall information content and interpretabi lity . 
This paper reviews the logical basis for band ratioing, 
discusses the desirable characteristics of ratio images, and 
compares several effective procedures for the production of 
them. 

The use of ratio images has been widespread and highly 
touted in the analysis of remote sensing imagery data, 
especially for lithologic studies (e.g., Vincent, 1972; 

Rowan et al., 1974,1977,1982; Goetz et al., 1975; Raines et 
al., 1978; Podwysocki et al., 1983) and vegetative studies 
(e.g., Maxwell, 1976; Tucker, 1979; Jackson et al., 1983). 
Band ratios enhance image information regarding the spectral 
reflectance properties of surface materials by subduing 
those factors of surface radiance that are correlated 
between bands, namely albedo and illumination intensity. 


Useful ratio images are made from two bands in which 
the various features of interest differ non-proportionally 


in their spectral reflectivities. These bands can be 



selected on theoretical grounds from knowledge of target 
spectra, or by evaluating test images of selected sites. 

The topics of band and ratio selection are specifically 
addressed by Crippen et al. (1987). The concern of this 
paper is instead the calculation and digital representation 
of the ratio image. 

RATIONAL FOR BAND RATI 01 NG 

The logical basis of digital band ratioing was 
presented by Kreigler et al. (1969). It can be readily 
illustrated. On a bi-spectral plot (Figure 1) pixels of a 
homogeneous surface material in rugged terrain generally 
fall along a straight line. The slope of such a line 
approximately equals the ratio of the two band 
reflectivities for the surface material, and the position of 
a pixel along the line is a function of the solar 
illumination intensity for that pixel. If pixel values are 
calibrated so that zero in each band represents zero ground 
radiance (the line projects through the graph origin), then 
the ratio values of the pixels along the line are fairly 
uniform, equal to the line slope, and independent of 
illumination intensity. Since spacial variations in 
illumination intensity are largely controlled by topographic 
orientation relative to the sun position, band ratios are 
largely free of topographic effects. Conversely, since 
pixels of materials of differing reflectance properties 
commonly plot along lines of differing slopes, ratio values 



are strongly related to surface material composition. 

Figure 2 provides an example of two bands and their ratio 
image. 

Where topography is not prominent, band ratioing can 
still be useful since it also reduces spatial variations in 
signal strength attributable to heterogeneities in 
atmospheric transmittance and to look-angle effects. Such 
variations will be removed by band ratioing to the extent 
that they are constantly proportional between bands across 
the image. 

The logical basis of ratioing can also be described in 
terms of the various components of sensor-recorded radiance 
measurements. Equation 1 (after Kowalik et al., 1983) 
describes, to the first order, a radiometric measurement by 
a nadir-viewing sensor. The measurement consists of four 
components *: 

(a) ground reflectance of direct solar illumination 

(b) ground reflectance of skylight 

(c) atmospheric path radiance 

(d) sensor calibration offset 

The first two components are modified by atmospheric 
transmittance, and all components are adjusted by the sensor 
gain factor. It is assumed that the ground is a lambertian 
surface . 


S’ • 




C 


« 


f 



footnote 


* If one defines pixels as rectangular areas having 

sharp boundaries, then a pixel "cross-radiance" component 
must also be included. If one instead accepts the 
indistinct, overlapping, true spatial character of pixels 
(as determined by both sensor and atmospheric optics), then 
this additional component is meaningless. 


S T 0 Hd COS OC S T A Hs 

L * + + S A + SO 

ir ir 


[i] 


L = 
S = 
T = 

/> = 
Hd = 

cos oC * 
HS = 
A = 
0 = 


recorded radiance 
sensor gain factor 
atmospheric transmittance 
surface reflectance 
direct illumination intensity 
cosine of the incidence angle 
skylight illumination intensity 
atmospheric path radiance 
sensor calibration offset 


All variables are specific to each wavelength band except 
the ground surface orientation term, cosoc. 

The underlying premise of band ratioing is that 
recorded radiances can be adjusted so that their ratio is 
approximately proportional to the ratio of surface 
reflectances , (for two bands "a" and "b"). Clearly, 



the approximation is unlikely to be close if the atmospheric 
path radiance or sensor calibration offset components are 
significant. The approximation is good, however, if (1) the 
direct illumination, atmospheric transmittance, and sensor 
gain are essentially uniform for pixels within each band, or 
are constantly proportional across the bands to be ratioed, 

(2) skylight either provides a negligible contribution to 
total illumination or has spectral proportions similar to 
the direct illumination, and (3) the atmospheric path 
radiance and sensor offset components can be estimated and 
removed (discussed below). If these conditions hold then 
all terms other than reflectance will reduce to a constant, 
c, by which the ratio of recorded radiances, adjusted by 

(3) , are linearly related to the ratio of surface 
reflectances (Equation 2). 


ratio of La-(SaAa+SaOa) /*a 

adjusted = = c — [2] 

radiances Lb- (SbAb+SbOb) /^b 


for bands "a" and "b" 

Since the cos oC (solar incidence) terms are equivalent 
between bands, they are eliminated by band ratioing (given 
the above stated conditions). Thus, unlike the original 
bands, properly derived ratio values are generally not 
correlated with topography. Instead, ratio values are 
almost entirely a function of surface reflectivities and, 
therefore, surface composition. 



RATIO IMAGE SIGNAL-TO-NOI SE RATIOS 


Many researchers have noted that noise commonly appears 
greater in ratio images than in the original bands. This is 
largely attributable to a proportionally greater reduction 
in image signal than image noise during the ratioing 
process. Image signal typically has a strong positive 
correlation (such as in Figure 1). Ratios therefore measure 
the bi-spectral data in a dimension that commonly has 
minimum signal variability. Noise, on the other hand, 
typically shows no interband correlation and measurements of 
it in the "ratio dimension" are approximately equal in 
magnitude to the average noise in the two original bands. 
Thus, ratioing reduces the signal but not the noise, and 
this results in a smaller signal-to-noise ratio. 
Additionally, some new quantization noise is added when the 
ratio values are represented by integers. 

THE IDEAL GENERAL-PURPOSE RATIO IMAGE 

In general, the following characteristics are desirable 
for ratio images: 

(1) The input radiance values should be corrected for 
atmospheric and sensor-offset additive-radiance terms so 
that the ratio values are closely representative of the 
reflectance properties of surface materials. 

(2) The distribution of ratio values should be positioned 
and scaled to fill the quantization range, with only minor 



saturation at the range limits. 


(3) The distribution of ratio values should tend toward 
histogram equalization (histogram flattening) so as to 
maximize, on average, the distinguishability of each pixel 
from all other pixels (maximize image entropy). 

(4) The ratio image should be invertible, so that 
information content does not change when the numerator and 
denominator bands are exchanged. Being invertible means 
that a ratio image is the negative of the ratio image 
produced by inverting the ratio. Invertible ratios are, on 
average, symmetrical about a central (median or mean) value, 
and deviations from this symmetry are attributable to image 
features and not to the ratioing procedure. 

In cases where only anomalously extreme values are of 
interest, some of these characteristics (such as histogram 
equalization) may not be desireable. However, for general 
purpose ratioing, the above characteristics tend to maximize 
information content and present it in a readily 
interpretable form. 

RAW RATIOS: PROBLEMS 

An inherently problematic yet still common procedure 
for producing ratio images is to divide the raw (unadjusted) 
pixel values of one band by those of another band to produce 
a "raw ratio" (which is then multiplied by a scaling 
factor). This procedure fails to compensate for the 



additive terms in Equation 1. Since these terms are largely 
independent of the reflectance properties of the surface 
materials, the resultant ratio values are not necessarily 
closely related to surface composition. As expressed 
algebraically in Equations 3a and 3b, ratio values are not 
constant, despite proportionally equal surface radiances (Ra 
and Rb) , if the additive terms (Aa and Ab) are not removed. 


Ra 

15 

30 

45 





Rb 

10 

20 

30 





but 








Ra + 

Aa 

15 

+ 8 
+ 3 

30 + 

8 

45 

30 

+ 8 

Rb + 

Ab 

10 

20 + 

3 

+ 3 


[3a] 


[3b] 


Commonly, the additive terms are attributed mostly to 
atmospheric path radiance, however the sensor calibration 
offset is commonly the larger term in the infrared 
wavelengths. The skylight term is only significant for 
shaded pixels (Kowalik et al., 1983), where ratio values are 
crude anyway due to relatively large signal quantization 
errors. 


Several methods are available for estimating the 
combined atmospheric and sensor-offset correction term, 
including Dark-Pixel Subtraction (Crane, 1971), 
Radiance-to-Ref lectance Conversion (Honey et al., 1974), 
regression-type techniques (Potter and Mendlowitz, 1975; 
Switzer et al., 1981; Crippen, 1986), and a statistical or 



visual i terative-ratioing procedure (Crippen, 1986). 

Examples of raw ratios (and ratios for which the 
additive terms have been incompletely removed) are plentiful 
in the literature. They are readily recognized by their 
significant retention (sometimes reversal) of topographic 
effects (such as in Figure 2c). The interpretability 
problems of raw ratios are easily explained by graphic and 
pictoral illustration. 

Figure 3 is a simplified bi-spectral plot depicting two 
contrasting ground cover types in rugged terrain. Data 
adjustments for the additive-radiance terms have not been 
made, thus the plots do not project through (and intersect 
at) the graph origin. Ratio values increase circularly on 
the plot in the counter-clockwise direction. Although the 
lines represent spectrally homogeneous materials, ratio 
values along each of them are greater for pixels of low 
illumination intensity. Also, dark pixels of material U 
have the same ratio values as brighter pixels of material M, 
and are therefore indistinguishable. Note that this is not 
the case when the additive terms (45 and 6 for bands 1 and 4 
respectively) are removed, as in Figure 1. Note also that 
these plots are derived from the image data of Figures 2a 
and 2b and explain the topographic reversal and spectral 
ambiguity of Figure 2c. M is freshly exposed bedrock and 
mining talus (high TM 1/4 ratio) and U is the adjacent 
undisturbed bedrock with coatings of desert varnish and 
other products of weathering (low TM 1/4 ratio). 



Problems attributable to the use of raw ratios have 
been cited by Krohn et al . (1978) and Kowalik et al. (1983) 
in attempts to discriminate limonitic outcrops from 
non-limonitic outcrops. In other studies this cause of 
feature discrimination problems has gone unrecognized by the 
authors . 

Some researchers have argued for the use of raw ratios 
because retained topographic information can provide 
benefits in visual image analyses. Most, however, have not 
acknowledged that these benefits must come at the expense of 
spectral distinctiveness. Others have argued that raw 
ratios have a greater signal-to-noise ratio than 
additive-term-adjusted ratios, however this is only true if 
one defines atmospheric path radiance to be "signal". For 
most applications it is more reasonably defined to be 
environmental noise. Clearly, removal of the additive terms 
is essential if ratio images are to coherently depict 
surface reflectance properties. 


SIMPLE RATIOS: PROBLEMS 

Ratios produced by simple division of 
additive-term-adjusted radiance data are closely related to 
the reflection properties of surface materials, and are 
herein termed "simple ratios". The formula for simple 
ratios is 


V 


( (Rn/Rd) * S ) - C 


[4] 



where V is the ratio value, Rn and Rd are the pixel values 
of two bands, S is a scaling factor that spreads the results 
over 256 quantization levels (with minor saturation), and C 
is an offset term to position those levels at 0 to 255. For 
general purpose use, problems remain for simple ratios. 

One problem is that simple ratios show an awkward 

complexity in that greater contrast is given to surface 

materials that happen to have greater data values in the 

band selected to be the numerator. This is because simple 

ratios less than unity (numerator less than denominator) 

have a range of zero to one (before scaling), while those 

greater than unity have a range essentially from one to 

infinity. A related problem is that while simple ratio 

outputs greater than unity are distributed linearly (2/1, 

♦ 

3/1, 4/1, ...), their equivalent inverses less than unity 
are distributed hyperbolically (1/2, 1/3, 1/4, ...). 

Another problem with simple ratios is that their 
distributions are commonly sharply peaked and the peak can 
only be finely quantized if the output scale is increased to 
the point where many non-peak values are saturated at the 
range minimum and maximum. 

As will be shown below, solutions to these problems are 


not difficult 


BAND RATI 01 NG FORMULAS 


A wide variety of formulas can be used to produce ratio 
images that possess all of the previously stated desireable 
characteristics and avoid or reduce the problems of simple 
ratios. Several are presented below. In each it will be 
assumed that the data have been corrected for the additive 
terms. The following symbols are used: 

V = Ratio value 
Rn = Numerator-band pixel value 
Rd = Denominator-band pixel value 
P = Proportionality factor (constant) 

S = Scaling factor (constant) 

Except where noted, all of the ratio formulas discussed 
below are invertible and provide a generally advantageous 
quantization that relatively compresses (crudely quantizes) 
extreme values and expands (finely quantizes) values near 
unity. Since between-band differences in the overall 
magnitudes of band values have no environmental meaning 
(being a function of sensor gain settings), any ratio value 
can be reset to unity to take advantage of the fine 
quantization there. Commonly, the user will want to 
increase overall pixel discr iminability (image entropy) by 
positioning the ratio modal value at unity (many ratio 
images are distinctly unimodal). Rn/Rd (the ratio of the 
average additive-term-adjusted radiances) is a good first 
estimate of the ratio mean and mode values. Its inverse, 


Rd/Rn, is therefore useful as a proportionality factor (P) 
for centering the ratio values approximately at unity before 
non-linear quantization. By selecting an appropriate 
proportionality factor, any range of ratio values can be 
emphasized in the quantization. Again, the proportionality 
factor is just the fractional inverse of a selected value in 
that range. 

The scaling factor, S, is determined by making a low 
guess, producing the ratio image and its histogram, and then 
increasing S so that the output ratio image values fill the 
quantization range with only minor saturation at the minimum 
(0) and maximum (255) values. If saturation is (or projects 
to be) one-sided and excessive then the midrange constant 
representing unity (initially 128) can be adjusted to 
reposition the ratio distribution. Only one trial iteration 
should be necessary, and it may be economically beneficial 
to grid-sample the image for producing the trial histogram. 

Note that ratios having quantization stretches that are 
symmetrical around unity, can be inverted: 

band 1 band 2 

> 

band 2 band 1 

by linear reversal: 

output value = range maximum - input value 

when unity is assigned to the midrange value. Thus, such 
ratios inverted on a display device (by reversing the lookup 


table) are numerically equivalent to the same ratios 
calculated with the numerator and denominator bands 
interchanged. 

All ratioing formulas presented will be evaluated and 
compared by ratio modelling later in this paper and are only 
briefly described here. 

Log Ratios 

Logarithmic ratios, first introduced by Goetz et al. 
(1975), are derived by use of Equation 5. 

( ( ( Rn ) ) ) 

V = 128 + ( ( log ( — * P ) ) * S ) [5] 

( ( ( Rd ) ) ) 

Note that log ratios are inherently invertible around 
unity because logs of ratios equal the negatives of the logs 
of inverted ratios. 

Arctangent Ratios 

Arctangent ratios were first proposed by Wecksung and 
Breedlove (1977) as another solution to the quantization 
problems of simple ratios. Equation 6 is a practical 
formula for their derivation. 

( ( ( Rn ) ) ) 

V = 128 + ( ( atn ( — * P ) - 45 ) * S ) 

( ( ( Rd ) ) ) 


The resultant values are linearly related to the 



angular position of the pixels as plotted on a bi-spectral 
graph (after proportionality-factor adjustment). 

Normal ized-Dif ference Ratios 

Normalized-dif ference ratios (Smedes et al., 1971) are 
very similar to log and arctangent ratios, yet are derived 
without the use of a complex function (Equation 7). 

( ( Rn - Rd ) ) 

V = 128 +( ( ) * S ) [7] 

( ( Rn + Rd ) ) 

In this case, however, to assure that the greatest 
stretch is applied to the selected ratio value, one band 
must be adjusted relative to the other before division 
occurs. This is achieved by substituting Rn*P for Rn in the 
formula. 

The so-called Transformed Vegetation Index (TVI ) is a 
modification of a normalized-dif ference ratio that uses a 
red band, Rn, and a near-infrared band, Rd (Rouse et al., 
1974). The formula is: 

( ( ( ( Rn - Rd ) ) ) ) 

V = ( ( SQR (( )+K))*S)+C [8] 

( ( ( ( Rn + Rd ) ) ) ) 

where K=1 is used to avoid taking the square root of a 
negative number (constants less than 1 can be used but 
cannot be pre-determined to work for all pixel values). C 
is negative and is used to position the values between 0 and 
255 after scaling. With K=l, the square-root function 


asymmetrically compresses the normalized-di f f erence values 
toward unity, Rn/Rd=l (Rn/Rd < 1 becoming larger and Rn/Rd > 

1 becoming smaller). If K=/=l, however, then the 
compression is toward some value other than unity (for the 
commonly used K=0.5, compression is toward Rn/Rd * 3). No 
logical quantization advantage is apparent for TVI and 
several authors have reported no practical benefit over 
simple ratios for measuring vegetation (e.g. Tucker, 1979; 
Perry and Lautenschlager , 1984; Yool et al., 1986). Its 
continued use therefore appears unjustified. 

A square-root function can be forced to be symmetric, 
to take advantage of it exponential quantization for general 
purpose ratioing, by using a dichotomous formula (Equations 
9a and 9b) . 

If Rn/Rd >= 1 then: 

( ( ( ( Rn - Rd ) ) 

R = 128 + ( ( ( SQR ( + 1 ) ) 

( ( ( ( Rn + Rd ) ) 

I f Rn/Rd < 1 then : 

( ( ( ( Rd - Rn ) ) ) ) 

R * 128 - ( ( ( SQR ( + 1 ) ) - 1 ) * S ) [9b] 

( ( ( ( Rd + Rn ) ) ) ) 

It will be shown later by modelling, however, that the 
square-root function has little effect on the quantization 
of normalized-diff erence ratios. 


) ) 

- 1 ) * S ) [9a] 

) ) 



Equilateral and Hyperbolic Ratios 


When only the extreme values of a ratio image are of 
particular interest, it is beneficial to avoid compressed 
far-range quantizations. Equilateral ratios (Equations 10a 
and 10b) provide a quantization that is symmetrical around 
unity and is equal to the greater-than-unity quantization of 
simple ratios. 


If (Rn/Rd) *P >= 1 then: 


( ( ( Rn ) 

R = 128 + ( ( ( — * P ) 
( ( ( Rd ) 


) . ) 
1.00 ) * S ) 
) ) 


[10a] 


If ( Rn/Rd )*P <1 then: 


( ( ( Rn ) 

V = 128 - ( ( ( — * P ) 
( ( ( Rd ) 


) ) 

1.00 ) * S ) 
) ) 


[10b] 


By exchanging the if-then conditional statements, the 
quantization is still symmetrical but is equal to the 
relatively hyperbolic, less -than-unity quantization of 
simple ratios. In extreme contrast to equilateral ratios, 
this quantization is even more compressed in the far ranges 
than that of any other ratio discussed above. These 
quantization differences between equilateral and hyperbolic 
ratios are directly related to the great asymmetry of simple 
ratios. See the Modelling section below for histogram 
depictions of these quantization differences. 



DEALING WITH VERY DARK PIXELS 


A problem that can arise in ratios of any form is a 
light and dark, "salt and pepper" effect (figure 2d) in 
areas of very low terrain illumination (topographic 
shadows). This results where sensor noise, signal 
quantization noise, and ratio quantization noise combine to 
become large relative to the recorded signals. Thus, in 
areas of low illumination (areas of weak signal), ratio 
values can be so crude as to be virtually meaningless. 
Removal of these pixels from the ratioing process may be 
desirable. This can be done by assigning them a "neutral" 
value (perhaps 127 or 128), or a value chosen to be readily 
distinguishable in the output image (e.g., zero). The 
cutoff value for acceptible versus unacceptibly dark pixels 
is sensor, scene, and feature dependent, in addition to 
being a matter of judgement. In general, however, ratios 
using adjusted radiance values of less than 10 will produce 
results of questionable quality. 

As a minimum requirement, pixels with band values of 
zero or less must be assigned some positive value before 
ratioing to avoid the mathematical impossibilities of 
division by zero or the taking of a log of a negative 
number. (Values of less than zero are possible after 
removal of the additive term when (1) a pixel is in deep 
shadow and the additive-term correction value for the scene 
is excessive for that specific pixel, or (2) when sensor 
noise results in a falsely low recorded radiance value.) 



DISCUSSION 


The need for using invertible-ratio formulas is 
directly related to the magnitude of the range of ratio 
values. When the range is small, then the differences 
between the scale of simple ratios and the scale of their 
inverses is small. Note, for example, that 1. 1/1.0 and 
1. 0/1.1 are nearly equidistant from 1/1, but that 4/1 and 
1/4 are not. Thus, invertible ratios are especially useful 
for removing scale distortion when utilizing bands that are 
poorly correlated. In general, these are also the bands 
that provide the most information when ratioed. 

In lithologic studies using Landsat MSS data, simple 
ratios have generally been an adequate ratio form because 
all bands are quite highly correlated. With Landsat TM data 
and data from broad-spectrum airborne scanners this is 
commonly not true, and the benefits of invertible ratios can 
be substantial. 

Non-linear ratio quantizations are generally beneficial 
because many ratio-image histograms are strongly unimodal. 
The non-linear quantizations described above compress the 
distribution of ratio values in the extremes, and expand it 
near unity. Thus, if the mode is positioned at unity, the 
quantization is finer where most of the results lie, and the 
average discriminability of each pixel from all other pixels 
is increased (image entropy is increased). This is not 
equivalent to distribution stretches applied after the 


ratioing procedure, which can only reduce discriminability . 
When the user is not concerned about overall 
discriminability but instead wants to increase the spectral 
discriminability of a selected range of ratio values, then 
non-linear quantization is again beneficial when the 
proportionality factor is properly chosen. 

The differences between various types of ratios will 
usually be more significant statistically than visually. 

For statistical use (such as an input channel to a 
classification algorithm), any increase in pixel 
discriminability is potentially beneficial. For viewing, 
however, pixel discriminability is subject to perceptual 
constraints. The number of differing grey levels that can 
be perceived in a scene is severely limited, varying 
inversely with scene complexity from about 8 to more than 
100. For a spatially complex ratio image, 256 grey levels 
are an amount of detail far beyond what can be perceived, 
therefore just how those levels are used is not very 
important. The impact of quantization type on perception 
should not be ignored, however, since the signal 
(noise-free) components of ratio images generally do not 
have high spatial complexity and, therefore the potential 
exists to perceive several grey levels. Also, since users 
commonly interactively stretch narrow ranges of ratio values 
in displays to emphasize the detail of selected features, it 
is beneficial to have superior quantization detail 
available. 



MODELLING OF THE RATIO OUTPUTS 


In order to demonstrate the differences between the 
various types of ratios, a hypothetical ratio image was 
created with the ratio values distributed about a central 
value of 1/1 (unity). This modelling was necessary in order 
to demonstrate effects of the differing ratioing methods 
independent of any scene-specific features. The 
distribution of ratio values used for the modelling is 
arc-normal. That is, if the pixel values are plotted on a 
bi-spectral graph and their values are taken as the slopes 
(in degrees or radians) of the lines connecting them to the 
graph origin, then the histogram of those values has a 
gaussian distribution. Real ratio images commonly have 
distributions not unlike those created by this modelling. 
That is, when expressed in terms of simple ratios (instead 
of arcs), their distributions are unimodal and distinctly 
peaked. 

The differences in the ratio outputs are easily seen in 
their histograms. All outputs were scaled to have 2% 
saturation equally split between the range ends. 

In Figure 4a, simple ratio values are not distributed 
symmetrically and are sharply peaked below unity. The 
output image is therefore not invertible, and much 
information (in terms of the distinguishability of pixels) 
is lost. 

Equilateral ratio values are symmetrical about the 



central value, and the ratio image is therefore invertible. 
However, the histogram is sharply peaked such that several 
pixels are grouped and indistinguishable (image entropy is 
low). On the other hand, values significantly different 
from unity have a finely resolved quantization. 

Hyperbolic ratio values are also symmetrical and 
invertible, but show a quantization stretch that is so 
severe at unity that it splits the arc-normal model 
histogram into a bi-modal distribution. Such a stretch 
could be advantageous for ratio images that are more peaked 
than the arc-normal model. 

In Figure 4b, normalized-dif f erence ratios, TVI ratios, 
and square-root normalized-dif ference ratios are shown to be 
similar, except for the non-invertibility of TVI. Note that 
the square-root function has only a minor effect on the 
peakedness of normalized-dif ference ratios. 

In Figure 4c, arctangent, normalized-dif ference, and 
log ratios all display flattened distributions that are 
symmetrical (and therefore invertible) around their central 
value. These ratios are clearly superior to simple ratios 
in maximizing overall pixel discr iminability , given the 
peakedness common in ratio image histograms. 

CONCLUSIONS 

Procedures are not complicated for maximizing the 
utility and overall information content of band-ratio 



images. Simply put, radiance values need to be calibrated 
to ground reflectance, and then ratio values need to be 
optimally distributed within their quantization range to 
maximize pixel discriminability . For general purpose 
ratioing, invertible quantization is preferable so that the 
resultant image information is not dependent upon the 
assignment of the bands to the numerator or denominator. 

This paper has reviewed the logical basis of band 
ratioing, discussed the desireable characteristics of ideal 
general-purpose ratio images, and described several 
potentially-usef ul ratioing formulas. Arctangent, 
normalized-dif ference , and log ratios are very similar, are 
easy to employ, and are generally preferred over simple 
ratios due to their invertibility and commonly-advantageous 
non-linear quantization of the resultant values. 
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Band b 


Figure 1. Bi-spectral plot of two surface materials of differing 
reflectance properties in rugged terrain. Pixel values are 
calibrated so that zero in each band represents zero ground 
radiance. The materials are readily distinguished by their 
ratio values despite having pixel values that greatly overlap 
in the individual bands. 
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Figure 2. Landsat TM imagery of the Eagle Mountain Iron 
Mine and vicinity, Southern California, 12 December 1982. 
(a) Band 1. (b) Band 4. (c) Raw ratio 1/4 showing 

reversal of topographic expression, (d) Log ratio 1/4 with 
additive-radiance terms removed. Freshly exposed bedrock 
and talus are clearly depicted as bright areas in d, but 
remain confused with adjacent areas in c due to the 
preservation of topographic effects. 
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Figure 3. Simplified ‘bi -spectral plot of image data from Figure 2. • 

Raw ratios do not differentiate the surface materials since 
their values are equal for brightly illuminated mine pixels 
and darker pixels of undisturbed ground. Also, dark pixels 
of both surface materials tend to have greater raw ratio values 
than brightly illuminated pixels, resulting in the reversal 

of the topographic effect as seen in Figure 2c. Removal of • 

the additive terms (45 and 6) results in distinct ratio values 
for the two spectrally distinct materials (as in Figures 1 and 2d). 
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Figure 4. Histograms of arc-normal modelled 
ratio image. Unity is at the mid-range value for 
all symmetric (invertible) ratios and is indicated 
by a vertical line for simple ratios and TV I. 
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The Jargon and technology of remote sensing research and 
appl icatlons move rapidly. One of the problems we have 
discovered In recent workshop meetings Is a difference In the way 
people are using a variety of terms. The following list of terms 
and abbreviations Is cu I led from a variety of sources, 
principally In non-reterreed sources. 


In the next year, we propose to continue circulate this list 
among a number of our NASA-sponsored colleagues. The Intent of 
this activity Is to develop a concensus definition of the terms, 
to further communication within the profession. 
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GLOSSARY 


AAG - Association of American Geographers. 

Accuracy - A measure of freedom from error. 

Active System - In remote sensing, a system that is the source of 
tne electromagnetic radiation reflected or scaTTered by tne 
object being sensed. For example, radar Is an active microwave 
system (compare with "passive system"). 

AGIS - Automated Geographic Information System. 

AgRlSlARS - Agriculture ana Resources Inventory Surveys Through 
Aerospace Remote Sensing. 

AGU - American Geophysical Union. 

AIAA - American Institute of Aeronautics and Astronautics. 

AID - Agency for International Development (U.S. federal agency). 

Algorithm - A procedure for performing a specific action; In 
software this Is a set of Instructions to tne computer. 

Alphanumeric - Consisting of both letters and numbers. 

Alphanumeric and graphics terminals - Usually a black and whiTe 
TV screen with attached keyboard, or a keyboard terminal with a 
graphics printer, for Interacting with a computer. 

Analog - the representation of numerical quantities by means of 
physical variables (voltage, current, etc.) as opposed to 
"digital". 

Analog Computer - A calculating device that operates on numbers 
represented as measurable quantities (e.g., voltages) rather than 
the numbers themselves (compare with "digital computer"). 

Analyzed Data Records (ADRs) - Those records which the 
Investigator designates as the best to display the scientific 
results of an experiment and provide the physical quantities by 
applying calibration curves or algorithms to tne corrected 
observed quantities of the Reduced Data Records. The data may be 
time averaged and may Incorporate mode I -dependent assumpnons to 
obtain the physical quantities. Charts, graphs, table, 
correlation coefficients, model parameters, phoTographs, and 
plots are possible forms of these records. 

Ancillary Data - Additional, supplemental daTa. 

Angular Field of View - See Field of View (FOV) . 

Antenna - The device that radiates electromagnetic radiation from 
a transmitter or receives radiation from other sources or 
antennas . 
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Array Processor - A specialized computation unit used In 
connection with a regular computer CPU, which performs operations 
an arrays (matrices) of data. Used when data arrays must be 
manipulated at high speed. 

ASCS - Agricultural Stabilization and Conservation Service (USDA) 
ASP - American Society of Photogrammetry . 

B/W - Black and White 

Band - A range of wavelengths of electromagnetic radiation 
specified to produce a single response on a sensing device; also 
known as a channel . 

Beam Width - A measure of the concentration of power by a 
directional antenna. It Is the angle subtended at antenna by 
arbitrary (normally one-halt the maximum) power-level points 
across the axis of the beam. 

BIL - Band Interleaved by Line 

Bit - In digital computing, a binary digit which may either be 
one or zero. 

BPI - Bits Per Inch 

BPS - Bits Per Second 

BSQ - Band Sequential 

Byte - A collection of 8 bits so that one byte can represent any 
number between 0 and 255. 

Calibration - the act of comparing certain measurements in an 
Instrument with a standard. 

Cartography - The science of representing graphically the known 
physical features of the Earth's surface, whether natural or 
arti tlclai . 

Catalog - A collection of detailed Information about whole data 
sets . 

Cathode Ray Tube (CRT) - An electron tube whose face Is covered 
by a phosphor that emits light when energized by Its electron 
beam; for example, a computer display screen or a television. 

CBD - Central Business District 

CCRS - Canada Centre tor Remote Sensing 

Cell Based Data Structure (CBDS) - A logical organization of 
spatial data in which geographic space Is modeled as a surtace 
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composed of cells. 

Census Tract - Small areas averaging 4000 In population ana 
varying In land area. They represent neighborhoods having 
similar socioeconomic characteristics and are defined by 
local committees In cooperation with the Bureau of the 
Census. 

Central Processing Unit (CPU) - That part of the computer 
hardware which performs the actual calculation and manipulation 
of data. 

Channel - In remote sensing, a narrow spectral Interval; also 
referred to as a band. 

CIk - Color Infrared Film 

Co- I nvest i gator (Co-1) - An associate of the Principal 
Investigator (PI) who is assigned a supporting role In the 
Investigation. In addition, some data rights may be assigned to 
the Co- I by the P- I . 

Compatible - Executable on more tnan one computer. 

Computer Compatible Tapes (CCTs) - Tapes containing data In 
machine-readable (digital) format. Four Landsat CCTs are 
required to cover one Landsat scene. 

Contour Map - A topographic map that portrays relief by the use 
of contour lines of equal elevation. 

Contrast - The amount of difference between tne Intensity of 
light areas and dark areas of an image. The highest contrast 
ratio is between black ana whiTe. 

Contrast Enhancement - A mathematical procedure usually performed 
by a computer to artificially increase the contrast In an Image, 
making certain features stand out sharply. 

COSMIC - Computer Software Management and Information Center 
CPU - Central Processing Unit 
CRT - Cathode Ray Tube 
CT - Census Tract 

crs - Communications Technology Satellite (Canadian) 

CZCS - Coastal Zone Color Scanner 
D/A - Digital to Analog 

Data Set - A collection of similarly formatted records having 
like information (from one or multiple sensors). Identical 
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processing, and, typically, common tempora I /spat l a I /spectra I 
resol ution. 

Data Set Granule - A segment of a data set, such as a single 
Image, one or several records In some fixed time Increment, or 
flies on each of several tapes. 

Data System - A package combining data directory, caTalog, and/or 
Inventory functions, as well as data access, sometimes 
manipulation, and display capab 1 1 I t I es . 

DEM - Digital Elevation Model 

Digital Computer - A calculating device that operates on the 
principle of counting rather than measuring quantities that 
represent numbers (compare with "analog computer"). 

Digital Format - Data storage In machine-readable form. 

Digitizer - A device that converts analog Information InTo 
machine-readable form. 

Digitizing - The process whereby an analog value such as position 
on a map Is converted Into digital coordinates or values. 

DIME - Dual Independent Map Encoding 

Directory - A collection of high level Information about whole 
daTa sets (names, locations, sources of further Information, 
other Items TBD) . 

Discipline Issues - Those SD Issues which are associated wiTh 
spatial data characteristics (see Section 2.1). 

Disks - Large capactty, direct access storage devices; daTa is 
stored as magnetic areas on the face of one or several platters. 
These platters rotate rapidly and each bit of daTa Is available 
to the reading/writing "head" once per revolution, providing 
high-speed data access. Also spelled discs. 

DLG - Digital Line Graph 

DMA - Defense Mapping Agency (U.S. Department of Defense) 

DN - Digital Number 
DSN - Deep Space Network 

EDC - EROS Data Center (USGS, Sioux, Fal is, SD) 

EDP - Electronic Data Processing 


ELAS - Earth Resources Laboratory Application Software 
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Electromagnetic Radiation - Energy propagated through 
space or a material In the form of waves of Interacting 
electric and magnetic fields; for example, radio waves 
ana I Ight. 

Electromagnetic Spectrum - An ordered array of known 
electromagnetic radiations Including ultraviolet radiation, 
visible radiation. Infrared radiation, and microwaves (see Figure 
2 - 1 ) . 

EMK - Electromagnetic Radiation 

EntiTy Based Data Structure (EBDS) - A logical organization of 
spatial data in which geographic space Is modeled as a surface 
composed of spatial entities described by locanonal identifiers 
related to a coordinate origin. 

EOF - End of FI le 

EOT - End of Tape 

EPA - Environmental Protection Agency 

ERIM - Environmental Research Institute of Michigan 

ERL - Earth Resources Laboratory (NASA, NSTL Station, Ml) 

EROS - Earth Resources Observation Systems (USGS, Sioux Falls, 

SD) 

ERAS - Earth Resources Technology Satellite (now called Landsat) 
ESA - European Space Agency 

Experiment - A term used Interchangeably with investigation (the 
latter Is preferred). Activity or effort aimed at the generation 
of data obtained by measurement of space phenomena or the use of 
space to observe earth phenomena and the resulting analysis of 
such data. 

Experiment Data Records (EDRs) - Those records provided to tne 
Principal Investigator, Team Leader, Guest Investigator, Co- 
Invest i gator, or team member containing all the data from the . 
mission required to carry out the Investigation specified in the 
contracT or launch agreement. These records may Include orbital 
position, spacecraft attitude. Instrument attitude, commands, 
housekeeping data, ground time, spacecraft time, daTa from OTher 
Investigations and other Information as agreed upon. The exact 
form ot these records and manner In which these daTa are 
provided may vary depending upon the policies, procedures, and 
capabiliTies of the project, the payload or mission control 
centers, the data acquisition network, and any support 
processing facilities. These records shall be specified in the 
Project Data Management Plan. 
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Fac! I l+y-Class Payload Mission - A mission In which the payload 
Is an Instrument or set of Instruments which serve as a facility 
for a large group of Guest Investigators who may be selected at 
different times throughouT the life of the mission to 
participate. This type of mission may not have Principal 
Investigators or Team Leaders and all tne data collected from 
such a mission Is generally maintained by the project for use by 
Guest Investigators. Availability of data for tne scientific 
community at large shall be specified at the Project Data 
Management Plan. 

False Color Composite - An Image formed by assigning colors to 
two or more black and white Images of the same scene and 
combining them in a computer or on film to form a single Image. 
The resulting image pinpoints differences and similarities 
between the original Images. 

Field of View (FOV) - The solid angle through which an 
Instrument Is sensitive to radiation. 

FL - Focal Length 

Flux - The rate of flow of some physical quantity, usually 
energy . 

Footprint - The area on the surface being Investigated by a 
remote sensing device; it Is approximately given by the product 
of the beam width (In radians) times the altitude of the remote 
sensing platform. 

FORTRAN - A commonly used, high level algebraic language for 
coding computer algorithms (derived from FORmual TRANs I atlon) . 

FOV - Field of View 

FSK - Frequency Shift Key 

Gamma - Unit of magnetic field strength equal to 0.00001 gauss 
(Earth’s surface magnetic field strength Is about 0.5 gauss or 
50,000 gammas) . 

GBIS - Geo-Based Information System 

Geostationary Satellite - An equatorial satellite circling the 
Earth at such a high altitude (about 36,000 km) that Its period 
of revolution about the Earth matches the Earth’s rotational 
period; the satellite appears fixed In the sky to an observer on 
the surface, and the surface of the Earth appears fixed to the 
saTeliiTe. Also called geosynchronous. 

GIS - Geographic Information System 

GMT - Greenwich Mean Time 

GObS - Geostationary Operational Environmental Satellite 
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GRD - Ground Resolve Distance 

Ground Truth - Information obtained on site; frequently used to 
assist and corroborate Interpretation of remotely sensed data. 

Guest Investigator (Gl) - InvesTigator selected to conduct 
observations and obtain data within the capability of a NASA 
mission, which are additional to the mission's primary 
objectives, or for a facility-class payload mission. 

H/W - Hardware 

Hardware - The physical components of a computer. 

HDDT - High Density Digital Tape 

HDT - High Density Tape 

HDTR - High Density Tape Recorder 

Heat flux - The rate of flow of hear energy through an object. 

High Level Programming Language - A programming language (such a 
FORTRAN) which Is close to everyday, written language; a single 
line of code written in a high level language can represent 
dozens of machine instructions. 

HOM - Hotline Oblique Mercator 

Human Engineering - The design and Implementation of systems 
taking human psychological concerns into consideration. 

I/O - Input/Output 

IBIS - Image Based Information System 
IFOV - Instantaneous Field of View 
IG - Initial Gap 

Image - The representation of a scene as recorded by a remoTe 
sensing system (includes both machine-readable storage and photo 
products) . 

Image Classification - Analysis of digital image values, 
including spatial, temporal, and spectral band relationships, to 
obtain categories of pixels or specific features. 

Image Display Terminal - Displays color and/or black and white 
images on a television screen. There may be an aTtached 
function keyboard to control manipulation of the Image. 

Otherwise manipulation Is performed via the computer to 
which the terminal is connected 
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In-code Documentation - The level at which comments have been 
added to the program's source code. A we I l -documented program Is 
easier to modify or transport. 

Information - Data that have been processed for a particular use. 

Input - Information or data transferred or to be transferred from 
an external sTorage medium Into the inTernal sTorage of the 
computer . 

Inpur/OutpuT Handler - The software which controls access from 
the CPU to peripheral devices. Including storage units and 
display terminals. This software may be written for use by a 
single program, separate package of subroutines which are used by 
several programs, or a separate package supplied wlrn tne 
computer. 


InteracTive - A method of operation that allows Instantaneous 
communication between man and machine. 

Interoperability - Electronic Interconnection of NCE modules, 
with appropriate supporting software, allowing sequential and 
concurrent exercise of system functionality in a quasi- 
homogeneous and convenient way, and allowing automatic migration 
of information among modules. 

Interval - 1) The time between two events; 2) as a measurement 
scale, describes ranks and classes of data where the difference 
between tne ranks Is defined and constant, but may be arbitrary 
(e.g., Celsius and Fahrenheit temperature scales). 

Inventory - A collection of Information about the granules of a 
data set. (Note that inventories giving names, locations, and 
Independent variable ranges of data set granules may need to 
be distinguished from those inventories characterizing data 
set granules by their contained data - e.g., % cloud 
coverage . ) 

Investigation - Activity or effort aimed at tne generation of 
data obtained by measurement of space phenomena or the use of 
space to observe earth phenomena and the resulting analysis of 
such data. 

Investigator - A participant In an investigation. This term may 
refer to a Principal Investigator, Co-Investigator, Team Leader, 
team member. Guest Investigator, or any other member of an 
Investigation group. 

IR - Infrared 

IRG - Inter-record Gap 

JPL - Jet Propulsion Lab (NASA, Pasadena, CA) 
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JSC - Johnson Space Center (NASA, Houston, TX) 

KBPS - Kilobits Per Second 

L/C - Land Cover 

L/S - Landsat 

L/U - Land Use 


LACIE - Large Area Crop Inventory Experiment 

Land Use - Human-Imposed functions of a land area. 

Lanocover - The vegetation and artificial constructions covering 
a land surface. 


LANDSAT - Land Satellite (Formally Called ERTS) 

Landsat - A series of Earth-observing satellites that serve as 
platTorms for the RBV and MSS instruments and will serve as a 
platform for the Thematic Mapper. 

Language Percentages - The fraction of a program or pacKage which 
ts written In each of several languages. 

Level of Man-Machine Interface - The existence or extent of 
dialogue between the computer program and the user. The highest 
level of Interface consists of question/answer or menu-driven 
programs; the lowest level of Interface includes batch-submittal 
of jobs via cards or card Images. 

LFC - Large Format Camera 

LIDAK - Light Detection and Ranging 

LL - Line by Line 

LLA - Adjusted Line Length 


Locational Identifier - A means of assigning content daTa to 
spatial entitles. Includes external descriptive identifiers, 
such as county name, identifiers which assign content data to . 
location within an array of discrete units (e.g., pixels), and 
coordinate Identifiers related to an origin. 

Low Level Programming Language - A programming language very 
close to the actual compuTer Instruction set; commonly called an 
assembly language (compare with "High Level Programming 
Language"). 

LP - Line Printer 


LPM 


Load Point Marker 
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Machine Language - The computer Instruction set (usually 
represented just by numbers; If characters and numbers are used, 
this becomes assembly language). 

Magnetometer - A device for measuring Magnetic fields. 

MAGSAT - Magnetic Field Satellite (U.S.) 

Manual Classification - The Identification of features on aerial 
or satellite photographs by tone, color, texture, pattern, shape, 
and size. 

Map - Usually a two dimensional representation of all or part of 
the Earth's surface, showing selected natural or man-made 
features or data, preferably constructed on a detmlTe projection 
with a specified scale; Includes digital maps, and other special 
maps . 

Map Projection - See Projection. 

MAPSAT - Mapping Satellite (U.S.) 

Maximum Likelihood Rule - A statistical decision criterion to 
assist In the classification of overlapping signatures; 
overlapping units are assigned to the class of highest 
probab 1 1 I ty . 

MCR - Monitor Console Routine 

Measurement Scale - A system for quantifying observations 
according to predetermined rules, which define four successfully 
greater levels of data precision (nominal, ordinal. Interval, and 
ratio); see Section 2.2.7. 

Medium - Physical device on which data are sTored, such as a 
photographic product, map, table, card, or magnetic tape. 

Memory - An organization of data STorage units In the computer; 
also the amount of main core storage required for a computer 
tasK, provided by direct access via the central processing unlT. 

Mission - One or more flights within an approved payload project. 

Mission Scientist - A scientist from a NASA field center assigned 
to a Spacelab mission, the Mission Scientist has similar 
funcnons as the Project Scientist with the exception of direct 
responsibility for the development of any experiments. 

MLA - Multi-Linear Array 

MPP - Massively Parallel Processor - GSPC 

MSS - Mu I tl spectra l Scanner 

MTU - Magnetic Tape Unit 
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Mu I ti spectra 1 (line) Scanner (MSS) - A remote sensing device 
capable of recording data in the visible and Infrared portions of 
the spectrum. 

Nadir - The point on the ground vertically beneath the center of 
the remote sensing system; also called the sub-satellite point. 

NASA - National Aeronautics and Space Adra I n I sTrat I on 

NASA Catalog Environment (NCE) - The aggregation of the data 
Information modules (directories, catalogs. Inventories) of 
NSSDC, project- and discipline-specific systems, and of 
Individual scientists; the interfaces to, and links among, these 
modules; the use and management of these modules; etc. (Note 
that NCE is viewed more as a coordinated grouping of systems 
rather than as a single supersystem. 

NASCOM - NASA Communications Network 

National Space Science Data Center (NSSDC) - The main central 
repository for selected data and documentation from space science 
flight missions that serves as a disseminator of this archived 
data and supporting Information to users throughout the 
international scientific community. The NSSDC, located at 
Goddard Space Flight Center, serves as a switching center tor 
requesters who desire data still held Individually by Principal 
Investigators (Pis) or Team Leaders (TLs) by providing a 
description of the spacecraft and experiment and the name, 
address and telephone number of the P! or TL. For missions 
Involving a Guest Investigator program in association with a PI 
or TL experiment or Involving a facility-class payload the role 
of the NSSDC shall be specified In the Project Data Management 
Plan. 

NCIC - National Cartographic Information Center (USGS) 

Nimbus - A series of Earth-observing experimental weather 

sate I i i Tes carrying a variety of sensors; the lasT of the series 

(Nlmbus-7) was launched late In 1978. 

NMD - National Mapping Division (USGS) 

NOAA - National Oceanic and Atmospheric Administration 

Nominal - As a measurement scale, distinguishes things only on 
the basis of their Intrinsic character (difference In kind, e.g., 
apples, oranges) . 

NOS - National Ocean Survey 

NRZ - Non-Return to Zero 

NSF - National Science Foundation 
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NSTL - National Space Technology Laboratory (Bay St. Louis, Ml) 

NT I s - National Technical Information Service 
OCS - Ocean Color Scanner 

Off-line - Peripheral equipment that Is not under the control of 
the CPU, as contrasted to on-l Ine. 

Off-nadlr Viewing - Intercepting electromagnetic radiation along 
a direction other than the vertical (nadir) between the sensor 
ana tne object being sensed (normally the Earth). 

OMB - Office of Management and Budget 

On-line - Under direct control of the Central Processing Unit. 

Operating System - The high-level administrative program running 
In a computer at all times which controls the overall operation 
of the computer and Its tasks. 

Ordinal - As a measurement scale, distinguishes things on tne 
basts of rank by some quantitative measure (e.g., small, medium, 

1 arge) . 

OSTA - Office of Space and Terrestrial Applications (NASA) 

Output - Information, data, or other results of a computer 
operation which are recorded on some external storage device 
(tape, disk, printer, etc.). 

Passive System - In remote sensing, a system that responds to 
electromagnetic radiation which Is either emitted by the object 
or coming from a natural source, such as the sun, and Is 
reflected or scattered by the object (compare with Active 
System) . 

Peripheral Device - A device connected to a computer to provide 
communication (e.g., alphanumeric terminal, printer, plotter) or 
aux I I i ary f unct I ons. 

PI - Photo Interpretation 

PI - Principal Investigator 

Pixel - "Picture element": The smallest unit of surface 

reflectance measured by a sensor system. 

Platform - In remote sensing, the physical object (e.g., balloon, 
rocket, or satellite) that carries the Instrument (sensor) that 
makes the remote measurement. 

Plotter - A printer used to record non-al phanumer ic Information, 
such as graphs or images, on paper. 
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PMT - Photomultiplier Tube 

Precision - The degrees of exactness with which a quantity Is 
stated; contrast with "accuracy," which refers to the absence of 
error, regardless of precision. 

Preprocessing - Manipulation of raw data tor STandard I zar l on 
prior to further analysis. 

Principal Investigator (PI) - A person who conceives an 
Investigation and Is responsible for carrying It out, reporting 
Its results, and providing appropriately selected data and 
supporting documentation to the scientific community In 
accordance with the Project Data Management Plan. The PI 
chooses hts Co-Investigators and asslgnes them roles and 
privileges. The PI Is the primary point of contact wltn 
the project office regarding the Investigation. 

Printer - An output device used to record the output (usually 
alphanumeric) on paper. 

Program Scientist - A NASA Headquarters official assigned to each 
mission who has a number of roles and responsibilities defined In 
NASA Management Instruction 7100.11, Attachment D. The most 
relevant one for this subpart Is the responsibility to establish 
the data analysts, data dissemination, and data archiving 
policies for the mission, which will be documented In the Project 
Data Management Plan. 

Project Data Management Plan (PDMP) - A plan tnat addresses the 
total activity associated with the data acquired by a mission 
from the delivery of the Experiment Data Records to tne 
Investigators to the delivery of selected reduced and analyzed 
records along wltn supporting documentation to a specified 
repository. The plan should provide the milestones in the data 
reduction, data Interpretation, and resource requirements for 
these phases. Any planned data Interpretation meetings, 
workshops, or other activities should be Identified. The type of 
data records, data products and compilations that have been 
selected in concert by the Investigators, the Project Scientist, 
the NSSDC acquisition manager, and any appropriate scientific 
advisory personnel for general availability to the I nTernati onal 
scientific community and for delivery to a disseminating 
repository, such as the NSSDC, shall be specified. For missions 
where the data will be maintained for many years by the project, 
the Principal Investigator handling a Guest Investigator 
program, or by an Institute established by the mission, the 
eventual transfer of appropriate data to a more permanent 
archive, such as NSSDC or other repository, shall be specified. 
Conditions for discarding or destroying the Experiment Data 
Records shall be specified. 

Project Scientist - A scientist from a NASA field center assigned 
to a project to manage the scientific aspects. The roles and 
responstbl I iTies of this function are given In NASA Management 
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Instruction 7100.11, Attachment E. 

Projection - A systematic construction of features on a plane 
surface to represent corresponding features on a spherical 
surTace. These features include observable phenomena (e.g., 
physical and cultural features such as coastlines and highways) 
as well as constructs (e.g., lines to represent parallels and 
meridians, political boundaries, or statistical units). 

Proximal Sensing - Obtaining Information by making direct, 
physical contact with It (compare to "remote sensing"); also 
making direct or In-situ measurements. 

PS - Polar Stereographic (Map Projection) 

PSK - Phase Shift Keying 

QA - Quality Assurance 

R/S - Remote Sensing 

R&D - Research and Development 

Radiance - Flux of radiant (electromagnetic) energy measured In 
power units (e.g., watts); frequently confused with radiancy 
(flux densiTy per solid angle, commonly measured in units of 
watts cm-2 sterad-1 ) . 

Radiometer - A passive device for Intercepting and quantitatively 
measuring electromagnetic radiation In some bank of wavelength. 

Radiometric Character f st I cs - Characteristics of a radiometer 
that help to define Its resolution. 

Ratio - A measurement scale; distinguishes tnings on the basis of 
magni-Image line by line. 

Ratio - As a measurement scale, distinguishes things on the basis 
of magnitudes that are Intrinsically meaningful by use of a non- 
arbitrary zero point (e.g., age or Kelvin temperature scale). 

Rawinsonde - An instrument balloon carrying temperature and 
humidity probes tn addition to a radio which transmits tne 
Information to an Earth station. 

RBV - Return Beam Vtdicon 

Rectification - The process of projecting an oblique or tilted 
image onto a horizontal plane to produce the equivalent of an 
until ted image. 

Reduced Data Records (RDRs) - Those records prepared from the 
Experiment Data Records by applying corrections, where 
applicable, for temperature, voltage, gain change, oftseTs, dead 
time, drift and other known Instrument changes, as well as 
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eliminating unusable noisy periods and periods of questionable 
Instrument performance. The Reduced Data Record should contain 
all the basic and supporting measurements obtained from the 
experiment, such as time, position, attlTude, settings of 
Instrument by command, housekeeping data and other Information 
neeaed to analyze the data in an independent fashion. Visual 
data, such as photographs derived from Imaging processing 
techniques, may also be considered as RDRs. 

Registration - Superposition of points on one Image with 
corresponding points on a second image or map of the same scene. 

Remote Sensing - Obtaining Information about an object or 
phenomenon without direct contact. (Compare with Proximal 
Sens I ng . ) 

Resampling - The use of mathematical or geometrical methods to 
estimate values on one cell based structure from values 
originally given on another structure; may Include Interpolation 
and extrapolation. 

Resolution - The minimum area of surface that can be Imaged by a 
remote sensor system. 

Return Beam Vfdicon (RBV) - A modified vldlcon television camera 
tube that produces high resolution images. On Landsat 1 and 2 
there were three cameras operating In the green, red, and 1R 
spectral regions to form the RBV system; Landsat 3 has two black 
and white cameras each covering halt the swath width. These 
Images are not used as frequently as the MSS Images. 

RF - Radio Frequency 

RJE - Remote Job Entry 

RT - Real Time 

S/C - Spacecraft 

S/W - Software 

SAB - Space Applications Board (National Research Council) 

SAR - Synthetic Aperture Radar 

Scalar - A quantity with a numeric value. 

Scale - The ratio of distance on a map, chart, or photographic 
image to the equivalent distance on the Earth's surface. 

Scaling - Transforming a display by multiplying all dimensions by 
a constant value, magnifying or reducing features. 

Scientific and Technical Information Facility (STIF) - NASA's 
document and report acquisition and abstracting facility that 
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produces a biweekly abstract journal, STAR, covering the 
aerospace report literature and a biweekly abstract journal, 

IAA, covering the published literature In these fields. The 
faclliTy also produces microfiche copies of the report 
literature for primary distribution. 

SEASAT - Sea Satellite (U.S.) 

Seasat - An Earth-observing saTelllte designed to gaTher 
Information about the oceans; launched In 1978, died abruptly In 
1 9/8. 

Sensor - A device that gathers electromagnetic radiation and 
presents It In a form suitable for obtaining Information about 
the env l ronment . 

Signature - A set of spectral, tonal, or spaTlal character I st l cs 
which together serve to Identify a class or feature by remote 
sens i ng . 

SLAk - Side Looking Airborne Radar 
SMS - Synchronous Meteorological Satellite 
SM;>a - Standard Metropolitan Statistical Area 
SNR - S I gna I -to-NoI se Ratio (also S/N) 

Software - A computer program as written in a high or low level 
language; may Include the documentation explaining the program. 

Software Organization - The manner In which the computer program 
Is arranged in memory. The simplest organization Is one task or 
several tasks which execute serially. The more soph i stI caTed 
arrangements Include tasks which are linked through the operating 
system or through a unique and self-contained linking program, 
and a single task decomposed to segments, which communicate with 
each other but execute Independently one at a time (overlays). 

Software Transportability - A rating given to the anticipated 
ease with which a program could be moved to another computer. 

This rating depends on the software's modularity. Its man-machine 
interfaces, its size and organization, and language In which it 
is written, the use of I/O handlers, and the documentation 
aval I ab I e . 

Space Science Flight Investigations - Investigations of natural 
phenomena of the earth and its environment, the moon, other 
planets, the sun. Interplanetary space, and other celestial 
object and regions made from aircraft, balloons, sounding 
rockets, satellites, probes, and manned spacecraft for the 
purpose of Increasing basic knowledge of these natural phenomena. 
Biological Investigations Involving the search for 
extraterrestrial life are included. 
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Spatial Data - Phenomena with Implicit or explicit locational 
identifiers; Includes spatial data content and spaTial daTa 
entitles. 

Spatial Data Content - Direct or Indirect measurements tnat are 
spatially distributed, which means that their values vary from 
one location to another, and data which relaTe to a single 
location, where variation in value occurs through time. 

Spatial Data Entitles - Points, lines, areas, or surfaces wltn 
which spatial data content are associated (e.g., weather 
stations, rivers, watersheds, and elevation models). 

Spatial Data Integration (SDI) - The process of combining 
multiple spatial data sets and providing for their sTorage, 
retrieval, analysis, and display. 

Spatial Data Structures - The logical organization of spanal 
data entitles and spatial data content. 

Spectrometer - A radiometer with a dispersive element (prism, 
grating, or circular Interference filter) so that the Incident 
radiation Is measured as a function of wavelength. 

SPOT - Satellite Probatolre d Observation de la Terre (France) 

Standard Metropolitan Statistical Area (SMSA) - A county or group 
of contiguous counties that contain at least one city of 50,000 
Innabitants or more, or "twin cities" with a combined 
population of at least 50,000. In addition to the county or 
counties containing such a city or cities, contiguous counties 
are included Is an SMSA If, according to certain criteria, they 
are socially or economically Integrated with the central city. 

Supervised Classification - Classification of Images using 
statistics developed from training sites. 

Swath Width - The area on either side of a platform which is 
surveyed by a remote sensing instrument; composed of many 
(overlapping) footprints. 

Tape - A long, thin, flexible medium for storing data; usually 
refers to digital magnetic tape on which data are stored as a 
string of ones and zeros by altering the magnetic domains of an 
Iron oxide coating on the tape; less frequently refers to analog 
magnetic tape or occasionally to paper tape on which data are 
stored by punching holes through the paper. 

Tape Drive - A device used for storing data on magnetic tape. 

Tape drives may record up to none tracks of data along the width 
of a standard computer tape. 

TBD - To Be Determined 

TDRSS - Tracking and Data Relay Satellite System 
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Thematic Mapper - A high speed, high resolution mu 1 1 1 spectra I 
scanner for monitoring Earth resources? planned for Landsat-D. 

TIRS - Thermal Infrared Scanner 

TLM - Telemetry 

TM - Thematic Mapper 

Training - Informing the compuTer system which site to analyze 
for spectral properties or signatures; also called signature 
extracT i on . 

Training Sites - Recognizable areas on an Image with distinct 
(spectral) properties useful for Identifying other similar areas. 

U-2 - High altitude (remote sensing) aircraft 

UHF - Ultra High Frequency 

UNESCO - United Nations Educational, Scientific and Cultural 
Organ I zat Ion 

Unsupervised Classification - Computer classification of digital 
Images by placing similar pixels Into categories without the aid 
of training-site data. 

USLE - Universal Soil Loss Equation 

UTM - Universal Transverse Mercator (Grid) 

UV - Ultraviolet 

Vector - A quantity possessing both numerical (scalar) value and 
direction. 

VHF - Very High Frequency 

VHRR - Very High Resolution Radiometer 

VICAR - Video Information Communication and Retrieval (JPL 
software) 

WBTR - WIde-Band Tape Recorder 
WEFAX - Weather Facsimile 
ZTS - Zoom Transfer Scope 
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SUMMARY 


John E. Es+es and Jeffrey L. Star 

This document represents a progress report of work conducted 
under grant NASA NAGW-455 during the period May 1, 1986 to 
January, 198/. This document also briefly describes tne 
directions we propose to undertake In the coming year of this 
fundamental and appl led research efrorT. 

Tfoe Information System Research Group research continues to 
focus on improving the type, quantity, and quality of Information 
which can be derived from remotely sensed data. As we move into 
the coming year of our research, we wll I continue to focus on 
Information science research Issues related to the Earth 
Observing System (EOS) of the Space Station Complex. The 
community of EOS users In the area of Earth Systems Science and 
applications represents scientists and researchers from many 
disciplines working on a variety of hardware and software systems 
at a number of geographical ly distributed locaTions across tne 
United States and around the globe. Past research conducted 
under this grant nas been used to extend and expand exisTlng 
remote sensing research activities at UCSB In the areas of 
georeterenced Information systems, machine assisted Information 
extraction from Image data, artificial Intelligence, and 
vegetation analysis and modeling. 

The program of research, documented In this progress report. 
Is being carried forward by personnel of the University of 
California, Santa Barbara. Through this work, we have targeted 
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fundamental research aimed at Improving our basic understanding 
of tne i ole of Information systems technologies and artificial 
Intelligence techniques In the Integration, manipulation and 
analysis of remotely sensed data tor global scale STUdies. This 
coordinated research program Is possible at UCSB due to a unique 
combination of researchers with experience In a I I these areas. 

Several of our projects have used this grant as a catalyst 
to aid other NASA offices In the research. In the integration of 
remotely sensed and other data Into an Information sciences 
framework. During this year we have received additional funds 
from NASA Code El to supplement ISR6 activities. In addition, we 
have conauctea research for the United States Geological Survey 
on related matters and have received funds from NASA Code E to 
Iook questions revolving around Browse of large spatial daTa sets 
I n the EOS Era. 

Grant actlvlTles continue to generate papers in reviewed 
journals and to support graduate student Master's theses and 
docTorai dissertation research. We feel these activities are 
very Important and we wi I I continue to support student 
researchers and to submit material for publication in reviewed 
journals and national and International symposia. In addition. 
Grant researchers have been, and continue to be, asxed to speak 
on EOS and Space Station related Information system research 
across The country and abroad. 

It Is our belief that The work conducted under tnls Grant Is 


University of California, Santa Barbara 


Page 2 


SUMMARY 


significant. We are making progress as we prepare to take 
advantage of the Tremendous potential ofrered by the Space 
Station Complex In general and the EOS system In particular. 
Research such as this ana that being supported by NASA aT omer 
universities and NASA centers which make multldlsclpl Inary Earth 
System Science a real Ity and further the appl icaTions of the 
earth observing satel I Ite remote sensor systems of the 1990's. 
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Preliminary Design of a Far* Monitoring 
Geographic Inforaatfon System 


David M. Stoas 



Prei imlnary Design of a Farm Monitoring 
Geographic Information System 


The fol lowing chapter is a portion of the M.A. Thesis of Mr. 
David Stoms. This work was funded In part by the current year's 
funding, and represents a proposed design for an information 
system. This geographic Information system will Integrate 
remotely sensed Imagery, field observations, and tne local 
farmer's detailed knowledge, as well as concepts from the field 
of artificial Intelligence. 

Mr. Stoms continues In his graduate work In our laboratory 
at this time, pursuing the Ph.D. degree. He plans to continue 
his efforts on the integration of geographic Information systems, 
remote sensing technology, and artificial intelligence. 
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ABSTRACT 


Preliminary Design of 
a Farm Monitoring 
Geographic Information System 

by 

David Michael Stoms 

The objective of this thesis is to improve our understanding of the need for and characteris- 
tics of an image processing/geographic information system (GIS) for monitoring crops on 
individual farms. A GIS is proposed that integrates remotely sensed imagery, field sensor 
data, and farmers’ intimate knowledge of their operations to provide frequent, rapid infor- 
mation. Expert systems would perform some of the image analysis tasks, identify deviations 
from desired conditions, diagnose probable causes of stress, and recommend appropriate 
management responses. 

For effective monitoring, remotely sensed inputs must be frequent, timely, of high spa- 
tial resolution, and with spectral coverage in the visible, near IR, mid IR, and thermal IR 
regions. Four satellite sensor systems (Landsat TM, SPOT, AGSAT, and the Earth Observ- 
ing System) are identified as candidates to provide multispectral data. None, however, 
meets all of the specified requirements. Recommendations are made for modifications to 
each sensor system to make them suitable for farm monitoring applications. 

This research outlines how, by putting remotely sensed data in an information system 
context, it can be made more useful for end-users not trained in image processing and 
interpretation. Development of this capability would be an important step in the dissemi- 
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nation of remote sensing and GIS technology to a broader range of users. 
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1. INTRODUCTION 

The objective of this thesis is to improve our understanding of the need for and 
characteristics of an image processing/geographic information system (GIS) for monitoring 
crops on individual farms. A GIS is proposed that integrates remotely sensed imagery, field 
sensor data, and farmers’ intimate knowledge of their operations to provide frequent, rapid 
information. Monitoring, as used in this thesis, refers to quantitative and qualitative assess- 
ment of general crop conditions, crop stage and biomass, plant stress, and expected yield. 
Expert systems would perform some of the image analysis tasks, identify deviations from 
desired conditions, diagnose probable causes of stress, and recommend appropriate manage- 
ment responses. This research outlines how, by putting remotely sensed data in an informa- 
tion system context, it can be made more useful for end-users not trained in image process- 
ing and interpretation. Development of this capability would be an important step in the 
dissemination of remote sensing and GIS technology to a broader range of users. 

In the early days of digital civilian remote sensing, it was predicted that farmers 
would be major beneficiaries of this new technology (National Research Council, 1970). 
Data would be available to assist in making farm management decisions concerning such 
activities as irrigation, fertilization, and pest control. Since then, literally hundreds of arti- 
cles have been published about large area applications of remote sensing in agriculture 
(summarized in Bauer, 1985; McDonald, 1984). However, remote sensing technology has not 
yet been applied to the daily management of individual farms. In fact, a recent article 
which details the history of agricultural remote sensing does not even mention farmers 
(McDonald, 1984). 

A number of recent trends in remote sensing hold promise for making commercial 
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farm monitoring applications more feasible (Brammer, 1984). Among these advances are 
the development of: 

-powerful, low-cost personal computers and display devices 
-geographic information systems 

-expert systems in remote sensing, image processing, and crop modeling 
-sensors with improved spatial and spectral resolutions, and 
-advanced data communications technology 

Taken in combination, these trends create the potential for a system where individual farm 
managers can do their own automated image analysis on site, without need for centralized 
data processing and analysis. Further supporting this trend is the current emphasis by the 
Federal government on the commercialization of space, and a renewed interest within 
NASA in applications oriented research. In a recent proposal submitted to NASA for a 
Center for the Commercial Development of Space, commercial real-time farm monitoring 
was one of the primary research areas {The Ohio State University, 1986), 

To date, there are no operational remote sensing systems that can meet farmers’ 
requirements for frequent, real-time crop information. There certainly are no systems avail- 
able that are simple to use by untrained interpreters. In fact, the features comprising a 
farm monitoring remote sensing system have not been clearly defined (Jackson, 1984). 

A geographic information system is typically defined as an information system con- 
taining data referenced by spatial location (Estes, 1986). Inputs are usually in map or 
image form, but may also be attributes in tabular form. The first operating system was the 
Canada Geographic Information System (CGIS), implemented in 1964. It’s original purpose 
was to locate marginal agricultural lands (Tomlinson, 1982). Since that time, dozens of sys- 
tems have been developed, but only about ten are commercially available (Estes, 1986). 


The commercial systems tend to be multipurpose and are not custom-tailored to farm moni- 
toring. 

Two aspects of a farm management information system are addressed in this thesis: 

1) the elements of a farm monitoring image processing/geographic information 
system, including the role of remote sensing 

2) the capabilities and limitations, within this GIS context, of four candidate 
satellite sensor systems for providing timely crop information 

These two aspects are discussed in Chapters 5 and 6, respectively. The development of a 
prototype GIS, complete with the appropriate agriculture-oriented knowledge-based sys- 
tems, is beyond the scope of this masters thesis research. However, several examples of 
monitoring cotton fields are described in Chapter 7. 

To lay the foundation for the analysis, the literature is summarized regarding farmers’ 
need for current crop information (Chapter 2), previous research into agricultural remote 
sensing (Chapter 3), and four candidate remote sensing systems (Chapter 4). The four satel- 
lite systems considered in this thesis are: 

1) TM — the Landsat Thematic Mapper 

2) SPOT — the French-built Systeme Probatoire d’Observation de la Terre 

3) AGSAT — a dedicated agricultural system proposed by a group of commercial 
vendors and professors from Stanford University, and 

4) EOS — the National Aeronautics and Space Administration’s (NASA) Earth 
Observing System consisting of several instruments proposed for a Polar Plat- 
form on the Space Station complex 

These four were selected because they are representative of existing and proposed systems 
that may be suitable for providing farm monitoring data. 

It is assumed in this thesis that a farm monitoring system has practical commercial 
application as a management decision tool. Farmer interest in and cost-effectiveness of such 


a system is not evaluated. Furthermore, it is assumed that most farmers are not trained in 
image processing and analysis, which implies that image processing tasks need to be 
automated. I also assume that the remote sensor systems being evaluated will operate as 
they currently exist or as they are proposed. This assumption covers not only the sensor 
package, satellite platform, and spatial, spectral and temporal resolutions, but the ground 
processing facilities as well. However, recommendations will be made in Chapter 8 on 
modifications to the sensor systems to make them suitable for farm monitoring applications. 
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Support for International Science 


This appendix Is the text from a proposal to NASA 
Headquarters, which was submitted during tnis contract year. The 
purpose of this proposal Is to examine the opportunities for 
col laboration and training In the International community. As a 
focus, we propose to work with the United Nations Environment 
Program offices In Nairobi, Kenya and In Geneva, Switzerland, 
with collaborators at NASA’s Earth Resources Laboratory at NSTL. 
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Remote Sensing Research Unit 
DeparnnenT of Geography 
University of California 
Santa Barbara, California 93106 


24 November 1986 


Mr. Alexander Tuyahov 
Code EED 

NA5A Headquarters 
Washington, D.C. 20546 


Dear Mr. Tuyahov: 


Please consider this document an unsolicited- letter proposal 
from the University of California, Santa Barbara, for work In 
support or international Information science applications of 
satellite remote sensing technology. The Information Science 
Research Group (ISRtf) at the University of California, Santa 
Barbara (UCSB) Is currently conducting advanced research In the 
Information sciences area tor the National Aeronautics ana Space 
Administration under two Grants (NAGW-455 and NAGW-987). Under 
these grants UCSB researchers are examining a variety of Image 
processing database, networking, and data systems Issues which 
wii i impact the earth sciences in the Space Station era. 


A major focus of this research Is facilitating access to a 
wide yarlety of regional, continental, and global scale Earth 
Science related data sets. This focus permits us to prepare for 
the analysis of Imagery and other data from the Earth Observation 
System planned for the polar platform component of the Space 
Station Complex. While considerable attention is currently being 
placed upon access to U.S. national datasets by a number of NASA 
centers ana NASA funded university research teams, ft Is our 
feeling that more attention to establishing ties to International 
data depositories Is needed. As such this document represents an 
unsolicited proposal to examine the problems and potentials 
Inherent In linking U.S. Institutions to International 
organizations which process, maintain, and disseminate 
International and global Earth Science data sets. 


Specif leal ly, we at UCSB propose to work jointly with 
personnel of NASA’s Earth Resources Laboratory, as well as tne 
United Nations Environment Programs Global Resource Information 
(UNEP/GRID) program In both Nairobi, Kenya and Geneva, 
Switzerland to examine the technical Issues and problems of 
hardware, software, data standards, and processing problems 
Inherent In tying U.S. and International foreign data 
depositories into a functional system which can facilitate earth 


24 September 1 98b 


Pag 



UNIVERSITt OF CALIFORNIA, SANTA BARBARA 




scientists at UCSB and other NASA affi I fated research 
I nsti tut ions, to better access the daTa resident with in tnese 

organizations. 


UCSB researchers already have experience working with 
UNEP/GRID personnel. Under existing NASA grants at UCSB we have 
vlsltea and obtained briefings on the operational status and 
future plans of the UNEP/GRID program In Geneva, Switzerland. 

UCSB personnel are also funded under NASA Grant NAGw-987 to visit 

Nairobi where briefings on UNEP/GRID will be held. In our • 

contacts with Geneva UNEP/GRID personnel, they have discussed an 
Interest In cooperating with ISRG/UCSB personnel In this effort. 


One specific Issue we would like to address is a need to 
augment the Image processing and database management and 0 

man i pu I at i on capabi I I ties at UNEK/GRIO Nai rob I. Th is w 1 1 l a I i ow 
efficient manipulation and and transfer of data from Africa to 
UCSB. We be I leve that this wl I l be necessary before meaningful 
analysis of the problems with and potential for International 
IfnKs with U.S. research organizations can be fully realized. In 

this proposed effort UCSB personnel propose to coordinate with 9 

NAsa National Space Technology Laboratories (NSIL) personnel to 

conduct a preliminary analysis of the prospects and problems 

Involved In linking UNEP/GRID Nairobi and Geneva Into U.S. NASA 

related Earth Science research Institutions. This International 

scale ot research. Involving elements of data set access, data 

standards, networking protocols, and so forth provide a good f 

model for the information systems now being developed for tne EOS 

era. 


The Global Resource Information Database (GRID) Is a new 
Initiative of the United Nations Environment Program (UNEP) and 
an element of the Global Environmental Monitoring System (GEMS). 
Currently finishing Its pilot phase, GRID grew out of a need to 
coordinate within a common geographic reference systems, the many 
environmental datasets that have been and continue to be 
col lected by GEMS other UNEP programs and United Nations (UN) 
specialized agenctes. GRID Is being designed and implemented to 
facilitate access to and analysis of these data sets for 
environmental and resource assessments. As currently structured, 
GRiu has two computer processing centers under UNEP direction: 
GRID-control at UNEP Headquarters In Nairobi, Kenya; and GRID- 
processor In Geneva, Switzerland. At present, the NASA Earth 
Resources Laboratory In Mississippi Is the principal NASA center 
providing technical support and engineering expertise to GRID. 


GRID-processor In Geneva is located In a University of 
Geneva Tael I ity provided to UNEP at no cost. It was designed and 
constructed specifically for GRID and the operation and 
maintenance of the facility Is contributed by the Center of 
Geneva. The facility Is well equipped with Prime, Perkin Elmer, 
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and ERDAS based data systems and Image analysis processing 
capab 1 1 iti es. GRID-control , Nairobi has been established within 
GEMS offices In UNEP's Headquarters at the UN, Glglrl Complex. 
GRiu-control functions within the admlnlsTratl ve context of 
Earthwatch . 


As previously stated, GRID Is currently in Its pIiot phase. 
The pilot phase of the GRID program has four primary objectives. 
These objectives are: 

1) to develop geographic Information system (GIS) 
methodologies and procedures for constructing and 
manipulating global environmental daTa sets tor tne purpose 
of conducting environmental assessments. (The continent of 
Africa has been selected as the case STudy for this 
objective); 

2) to demonstrate That GIS technology as applied within 
GRID Is an effective tool which combines global and national 
data sets for resource management and planning applicaTlons 
at the national level. (The candidate countries for these 
demonstrations are China, Ethiopia, Indonesia, Kenya, 

Panama, Peru, Senegal, Sudan, Thailand and Uganda); 

3) to establish the framework for cooperation and 
data exchange within International and 
Intergovernmental organizations which deal wiTh 
environment-related matters, such as FAO, WHO, WMO, 

ICSU, ILCA, IUCN, etc.; and. 


4) to provide training opportunities in GIS and resource 
data management technologies employed by GRID to the 
scientists and resource managers from particIpaTing 
developing countries. 


The pilot phase of GRID originated as a cooperative project 
between the GEMS Programme Activity Centre of UNEP and the 
National Aeronautics and Space Administration (NASA). It grew 
out of common interest of the two organizations. UNEP Is 
mandated to monitor and assess the state of the global 
environment on a continuing basis; NASA is one of tne principle 
organizations Involved in studying the long-term changes In 
processes that affect life on The EarTh. From this beginning 
other nations and organizations have been brought Into the 
venture so That GRID has become a truly I nTernat l ona I cooperative 
effort. It Is our feeling that It Is now time to Include the 
NASA related university community In this etforT. The research 
proposed herein represents a preliminary step In this direction. 


UCSB has been involved In a number of etrorTs over tne pasT 
years which directly Impact the requirements and opportunities 
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for International dataset production and exploitation. In May 
198.>, NASA’s Office of University Affairs signed a grant 
establishing a Remote Sensing Information Sciences Research Group 
at UCSB. Research conducted under this ongoing grant has been 
used to extent and expand UCSB’s efforts In the areas of 
georeterenced Information systems, mach I ne-assISTed extraction of 
Information from Image data, artificial Intelligence, and 
modeling. The group's philosophical emphasis Is to Improve tne 
type, quantity, and quality of Information which can be derived 
from remotely sensed data. 


Two examples of our efforts under tnis grant nave direct 
relevance to the problems of global dataset creation and 
dissemination. In April 1985, we completed a brief STUdy which 
considered the app I Icatlons of artificial Intel I Igence techniques 
to large distributee networks. We Identified a number of areas 
where such technology could provide more efficient problem 
solving, communications, and data base creation and sharing than 
conventional techniques (Dubayah, Smith, and Star, 1985). 


Another example of advanced technical applications Involves 
a fundamentally new geographic information system, developed 
unoer shared funding from NASA, United States Geologic Survey, 
and Digital Equipment Corp. This new system, KBGIS II, 
Incorporates new developments in data structures, software 
engineering, and artificial intelligence. The system provides a 
user-corn Igurab I e applications interface, one-step search for 
complex objects, and state-of-the-art capabilities for 
intelligent search of exceedingly large datasets. Such a system 
would be a natural environment for the manipulation and analysis 
of global -scale data. 


UCSB has Just received funding from NASA Headquarters for a 
new effort which also has direct relevance to the development and 
use of global datasets. This new grant (NASA NAGW-987, "Browse 
In the EOS Era") permits to work on the key Issues of browse of 
remotely sensed and other ancl I l ary data. Earth science and 
relevant technology have developed to a point where a number of 
agencies. Including UNEP, are actively considering the 
development of large area datasets. These large, heterogeneous 
spatial data Holdings require hew Insights in daTabase management 
and processing, to be able to serve a wide range of 
geographically distributed users. Our work under tnis new grant 
will Include developing a set of functions that permit a user to 
examine remote uatabases, to be able to make an Intel I igent 
decision on when these databases contain useful material for a 
variety of different processing and analysis functions. 


There Is an International need to Improve access to global 
data sets tor NASA and NASA-related researchers both in tne 
United States and abroad. The UNEP/GRID activity Is a source of 
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such da+a, of growing significance to both research and 
operational agencies. It Is Important That we begin to 
facilitate Interactions to assess the problems and prospects for 
connecting Into this Important source of global daTasets. The 
material which fol lows presents our methodology and research 
requirements tor beginning this Important task. 


METHODOLOGY 


1 Analysis of existing systems In Geneva and Nairobi. 

In our contacts with staff at GRID'S Geneva facility and 
NSIL personnel, under NASA Grant NAGW-455, we have examined and 
discussed the primary functions required at UNEP/GRID Nairobi. 

• We win determine whether sufriclent nardware is available In 
Nairobi, and advise them on necessary staff development. 

2 Determination of requirements tor system augmentation aT 
Geneva, Nairobi, and Santa Barbara. 

# One of the principal constraints of new systems aT These 
location Is compatibility with existing hardware and software at 
principal NASA and UNEP/GRID facilities. We will continue our 
analysts Jointly with staff at NSTL, NASA Headquarters, and 
UNLP/GRlD Geneva. 


• 3 Subject to funding, purchase required hardware and software. 

4 Install and Integrate new capabilities 

Our col iaborators at NSTL have detailed experience In the 
specific hardware and software environment Instal led at GRID 

• Geneva. Their specific participation in this proposed ettort 
will include acceptance tests of the equipment at the shipping 
location. NSTL has also agreed to ship the equipment to Nairobi, 
as wel 1 as provide staff time and travel for Instal latlon and 
maintenance during the contract year. 

f 5 Conduct pilot tests of data transfer to document the 

capabilities of the new components. 

" UCSB personnel will trave I to Nal rob I to begin staff 
development and training In boTh image processing and geographic 
Information systems technology. We will also provide training 
^ material In these areas, for use at UNEP/GRID Nairobi. This 

training material will Inc I ude both read I ng mater l a I s and samp 1 e 
daTasets. The sample datasets will be chosen after discussions 
with our collaborators at NSTL and UNEP/GRID, and can form a 
basis for future staff development and training courses. 


6 
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Based on these meetings, as wel I as discussions with our 
otner col i aborators and technical mom Tors, we wi i i begin 
preliminary analysis of the problems and prospects Involved In 
efficiently linking these different organizations. This analysis 
will result In a report, detailing the analysis and the suggested 
aiTernatives for efficiently moving daTa beTween the laboratories 
at NSTL, Geneva, Nairobi, and Santa Barbara. 


DELIVERABLES 


UCSB will purchase an appropriate computer system, based on 
requirements and constraints Identified by ourselves and 
col i aborators at NSTL, NASA Headquarters, and UNEP/GRID. 

Shipment and Installation of this system will be provided by 
NASA’s Earth Resources Laboratory, NSTL. Training In the 
operation and applications of this system will be undertaken 
Jointly by NASA and UCSB staff. 


UCSB will provide a final report of our act l v itles w itn 
respect to this grant. This wll I detail our meetings during the 
project, our analysis of the utl l iTy of the computer In Nairobi, 
and tentative plans for future col laboratlon and support 
activities for the effective dissemination of NASA-developed 
technology to the developing countries now being assisted by the 
UNtP/GRID program. 


We look forward to the opportunltes which this grant can 
provide. If funding becomes available. 



Prof. John E. Estes 
Department of Geography 
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This appendix Is the text from a proposal to NASA 
Headquarters, which was submitted during this contract year. 

The first year of this proposal has been funded, and we have just 
started what we hope to be a three to tour year ettort. Our work 
under this new effort for NASA interfaces well with our Office of 
UniversiTy Affairs grant. The Browse program Is designed to 
provide us with experience in problems which will become more 
acuTe m me timeframe of the EOS platform. Under tne Browse 
program, we will develop a testbed. In which we will examine some 
of the problems of geographical iy distributed archives, and tne 
problem of determining the scientific value of remotely sensed 
data located at a remote archive. 
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ABSTRACT 

The Information Science Research Group, University of California, 
Santa Barbara, proposes to supplement their existing Natlona.l 
Aeronautics and Space Administration grant, number NASA NAGw-4b!>, to 
conduct basic and applied research on browse In the EOS era. The 
newest ana most Important initiatives In tne U.S. civilian space 
program revolve around the Space Station Complex. From the perspective 
of scientists studying the dynamic coupling of tne I itnosphere, 
biosphere, hydrosphere, and atmosphere, the most important component of 
tne Space Station Complex Is the Earth Observing System (EOS). 

Data volumes projected for EOS are extremely high. The high data 
volumes, coupled with the geographical ly distributed nature ot tne EOS 
user community, calls for Innovative research on access to EOS data. 

It Is most Important to be able to browse EOS datasets and select only 
that portion of the data relevant to a given research problem. 

This unsolicited proposal develops a tnree-year phased research 
program. Our program Is aimed at the development of a robust, well- 
structured system for browse, able to support users of different levels 
of sophistication. 

There are two primary functions Imbedded in the testbed we propose 
to develop: locating relevant data, and viewing the data. We will 
emphasize the latter In our proposed efforts. Locating data Involves 
four general types of Information about the data: spatial location. 
Information about the data object, themes of coverage, and Information 
about access to the data Itself. We w I I I rely In part on ongoing 
efforts in this general area, including those at NSSDC and tne NASA 
Pilot Data Systems. The second function, viewing the data, includes at 
least tnree levels of access: viewing the data itselT, viewing 
attributes of the data, and viewing information derived from the data. 

During the first year of this proposed effort, we will nold 
meetings of selected scientific users, conduct function tests and 
system development, and we wll I begin col laboratlons wltn other data 
archive sites. Results of these activities will be documented In 
progress and annual/final reports, and In the reviewed literature. 

In conclusion, the concept of EOS as an information system Is an 
important philosophical direction for the program. EOS will 
significantly advance our understanding of our global system, as data 
Is provided to scientists in many disciplines around tne world. Browse 
of datasets is a key element in facilitating scientific access to EOS 
data . 
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INTRODUCTION 


This documenT represents an unsolicited proposal for a supplement 
to National Aeronautics and Space Administration (NASA) Grant Number 
NASA NAGW-455 to conduct basic and appi fed research on key Issues 
related to browse of EOS and related and I I ary data by 
InTerdlscI p I I nary science users. In May, 1983, the oftlce of 
University Affairs of the National Aeronautics and Space Administration 
(NASA) signed a granT esTablishlng a RemoTe Sensing Information Science 
Research Group at the University of California, Santa Barbara (UCSB). 
Research conducted under tms granT continues today and Is aimed aT 
Improving the type, quantity and quality of information which can be 
derived from remoTeiy sensed data. During tne pasT year, the basic and 
applied research conducted under the auspices of this grant has begun 
to focus on me informanon science research need of the Earm 
Observing System. 


As the world's population expands, there is an ever increasing 
need for systems and techniques capable of acquiring, integrating, 
communicating ano analyzing information concerning tne extent, use of 
and change In major components of the earth's surface. NASA is playing 
a key role In the development of such systems ana capabilities. This 
role requires that forslghted fundamental and applied research be 
directed towards the scientific application of technologies which can 
Improve the base upon which assessments may be made of both the current 
ana changing status of the components of the Biosphere, hydroshere, 

I Ithosphere and atmosphere. 


The proposed Earm Observing System (EOS) can play an important 
role in improving our understanding of the spatial and temporal 
dynamics of major components of our global life support system. EOS, 
as defined by the Science and Mission Requirements Working Group (NASA, 
1S»S4a) consists of: "a suite of instruments in low Earm orbit 
acquiring measurements of the Earth's atmosphere, surface, and 
interior, an information sysTem to support scienTiric research, and a 
vigorous program of scientific research stressing study of global -scale 
processes thaT shape ana influence the Earm as a system". EOS is 
being designed to produce large volumes of mu I tispectra I , multitemporal 
data tor multi-disciplinary investigations. An ImporTanT 
consideration of the EOS program then. Is the development of an 
I nf rastrucTure and mechanisms to speed del ivery of daTa from EOS to tne 
science community. A key aspect of the capability is the development 
of a mechanism to Insure that investigators are noT deluged wim 
unnecessary or redundant data. A mechanism which can aid in minimizing 
the potential of this happening Is to develop ways ana means to permit 
scientists to browse EOS data and select only those data most 
appropriate to their research. 


The unsolicited proposal which fol lows describes a tnree year 
phased research effort to address the question of browse in the EOS 
time rrame. Although a three year effort is proposed, we are at 
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present requesting supplement of our existing NASA grant (NASA NAGW - 
4bb) for the firsT year of this efrorT. Funding for tne second and 
third years will be requested later, and we understand that such 
funding is connngenT upon bom The performance of the research and 
funding availability. 


After This introduction, a background secTion describes Tne need 
for the EOS system and its Importance to global science. This is 
fol lowed by a conceptual framework for tne proposed research on Tne 
question of browse. This section Includes approaches to the study and 
descriptions of our proposed three year-phased research etTorT. A 
conclusion section is next, followed by_ the budget for this effort. 

YiTa ot key project personnel and descripTions of faciMTies and 
equipment at UCSB are also included. An appendix, describing a related 
efrorT, is attached aT The end of this documenT. 

* -V 


It should be emphasized that this research wil I take advantage of 
the already extensive inTerdiscipl Inary conTacts and research on Space 
Station and EOS related activities currently being conducted at UCSB. 
UCSB researchers are working directly in tne area of land, cl ImaTe and 
oceanographic research. In addition, we are cooperating with and/or 
working on research for the U.S. Geological Survey and the Research 
Libraries Group (see appendix) on topics related to improving the 
scfenTitic uti I iTy of spanai data. Finally, tne research efrorT 
proposed herein wll I gain maximum benefit from membership on important 
daTa sysTem act I v iti es committees and working groups on Tne parr of 
project scientists. These committees and working groups include: the 

CommiTTee on DaTa ManagemenT and CompuTaTion; the Space StaTion DaTa 
User Working Group; EOS Data Systems Panel; Pilot Land Data System 
Science Steering Group; Research Library Group Task Force on Geo- 
Informatlon; and the Task Force on Scientific Uses of Space Station 
EarTh Science Panel. Membership and parTlci paTlon in these etrorTs 
will continue to provide insight as the research we propose moves 
forward. 


BACKGROUND 


Earth science and associated technology development have 
progressed to a poinT where the conduct of global science appears 
feasible (Estes and Star, 1986). This scientific and technological 
developmenT has been caused, in parT, by the emergence of problems 
which require a multidisciplinary approach (NASA, 1984a). The need to 
reach an improved undersTandi ng of problems such as as deforesTati on, 
dessert i f i catl on, changes in atmospheric carbon dioxide, and acid 
deposiTion (To name a few) grows more imporTanT wiTn each passing year. 
The understanding of these problems, however, requires new approaches 
to science, which comb i ne mu 1 1 i disci p I I nary teams employing advanced 
technologies which can generate a type, quantity and quality of data 
noT previously available to Tne scienTirlc community. The proposed 
Earth Observing System (EOS) currently being planned for the polar- 
orbiTing platform of the United States Space Station complex offers 
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this potential. 


The newesT and mosT ImporTanT Initiatives In the U.S. civilian 
space progam currently revolve around the Space Station Complex. The 
Space Station complex Includes the core station, Its associated co- and 
polar orbiting platforms and Information systems. This proposed suite 
of platTorms and support systems ofrers a unique potential for 
facl I Itatlng long term scientific Investigations on a truly global 
scale. 


Basical ly, the man-tended systems which are proposed for tne 
various Space Station Complex platforms have the capability of 
providing a wide range of data from botn operational and research 
sensors. , These large volumes of mu 1 1 Ispectra I , mu I tl tempora I data, 
which wi i i be generated by the proposed systems, must be supported by 
efficient and effective Information systems to provide the potential 
for data continuity which has, to a large degree, been lacxing from 
sensor systems ooperatlng on Independent free flying platforms. The 
chai i enge to the remote sensing community Is, In essence, two-fold. 

The first challenge Is to get ready to handle the large volumes of data 
wmch will become available In the 19y() time rrame. The second 
chal lenge Is to bring the science and technology we are developing to 
broader constituency. In the service of global science (Estes and Star, 
1986) . 


From the perspective of scientists studying tne earth's surface 
and atmosphere, the most Important component of the Space Station 
complex Is the Earth Observing System (EOS) (NASA, 1 984a; NASA, 19b4b). 
EOS, based on the current design concept, has both active and passive 
eartn surface Imaging systems as well as atmospheric sounding systems. 
EOS represents an evolutionary step In our capabilities for remote 
sensing of the earth. When implemented, EOS may provide tne eartn, 
ocean, and atmospheric science communities with data to support 
unprecedented integrated Investigations among disciplines and 
scientists from many nations. Unlike the previous generation of 
satellites, designed I arge I y for rel atl ve I y l imited constituencies 
(e.g., Landsat for the land scientist, Seasat for the oceanographic 
community), EOS has the potential to provide an Integrated source of 
Information which recognizes the problems and Issues which Involve the 
dynamic coupling between tne oceans, land surface and tne atmosphere. 


In the same way that EOS represents an evolution in earth viewing 
satel i ite technology, we bel leve that tne scientific objectives which 
EOS may help to advance can produce an evolutionary Improvement In our 
understanding of our planet. Traditional branches of tne earth 
sciences have, in the past, generally focused on problems which are 
I imited In scope to modest areas and to relatively narrow ranges or 
biophysical, geochemical and socioeconomic processes. Traditional 
science has been limited by the capabilities of existing technologies 
to measure, map, monitor, and model key elements of these processes. 
EOS datasets will be able to expand our capabilities In tnese areas. 
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and will facl I I+a+e new scientific Insights concerning the wor I d we 
I Ive In. 


EOS has the potential to foster expanded col laboratlon wltnin tne 
scientific community towards Interdisciplinary science on an 
International scale. Yet, It we are to ful ly employ the potential of 
EOS It must be done within an Information systems context, linking 
scientists together wltn both required facilities and each other. Such 
an approach can Improve the global science community's access both to 
data sources and processing capabilities. Global science Is a daia- 
Intenslve activity and In Its broadest sense, EOS, as an Information 
system, can provide a tool for Improved understanding of our planet 
(NASA, 1984a). There are, however, a number of unanswered questions 
concerning the operational and commercial uses of the sensor systems on 
polar platforms. 


Central to these questions Is science access to tne daTa from tne 
EOS systems. Indeed a key recommendation of the EOS Science Mission 
Requirements Working Group is the development, as soon as possible, of 
"a data system that provides easy. Integrated, and complete access to 
past, prestent and future data". Data volumes projected for tne EuS 
system are high. EOS Synthetic Aperature Radar (SAR) and High 
Resolution Imaging Spectrometer (HIRlS) data rates are projected 
between 700 and 800 megabits per second. Daily data rates of one 
terabyte per day have been suggested. To Ii I ustrate tms size more 
dramatically, one terabyte Is a shelf of floppy disks roughly seven and 
one-halt miles long, or a 400 square foot room of 1600 BP I tapes. 


The real chal lenge to the International science community In 
general, and NASA In particular, is the development of tne 
Infrastructure to facilitate the operation of the EOS Information 
System. Scientists must be able to achieve rapid access to needed 
data. The ability to browse data and efficiently and effectively pre- 
judge Its potential utility for a given Investigation Is a critical 
need. The material which fol lows, then, represents an unsol Iclted 
proposal for a three year phased applied research efTort to study 
important Issues related to browsing remotely sensed ancl I I ary data 
sets of Interest to future EOS science users. 


CONCEPTUAL FRAMEWORK. 


The stated goal of the EOS Data and Information system. Is "to 
meet the challenges of EOS mission operations, data transport, 
processing and ata management, In addition to tne chal lenges of access 
to information and data not under direct control of EOS or even NASA" 
(NASA, 1 98b) . This requires tne development of a geographical ly 
distributed information system capable of handling the large volumes 
of data which EOS will produce. 
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It Is anticipated that by the 1990's, local processing 
capabl I i Ties combined with neTwork technologies will a I low such a 
spatial ly distributed system to become a real Ity. A key objective of 
the EOS informanon System Is to provide local and remoTe InTeractlve 
access to a variety of capabilities, data sets, and resources. With 
data loads projected as high as one terabyte per day, the ability to 
browse datasets prior to retrieval must be an Important function for 
the EOS Informanon sysTem. CurrenT esTlmaTes of tne nee.d for such a 
browse capability, of the type we propose to Investigate, ranges 
between 1500 and 15,000 users (NASA 19ti6>. 


Because of the large data volume which can be generated by EOS, 
coupled wlTh user requirements for ancl I I ary data for test slTes around 
the globe, user-friendly human Interfaces to the Information system are 
necessary. Users wll i require a variety of kinds of assistance to be 
able to find potentially relevant data resources, and evaluate their 
potential Importance. Knowledgeable users may have detailed Information 
about data sets, archives, and processing capabilities; these users may 
preTer a terse, streamlined Interface to the system. Less 
knowledgeable and Infrequent users of the system may require more 
detailed on-line user assistance. Including help facilities ana menu- 
driven Interfaces. Such users may require detailed Information about 
daTa set characteristics such as sensor parameters, and Information 
such as spectral, temporal and geographic characteristics of available 
datasets - both for EOS-produced data and Important and I I ary data. 


These requirements Indicate the need for a robust, we I I -structured 
system, able to handle users of different levels of expertise about 
both EOS sensonrs and the data system Itself, as wel I as users from a 
range of disciplines. The ablliTy to browse daTa seTs will be central 
to the needs of both advanced and I ess-soph I stlcated users. 


BROWSE 


There are two functions to examine when considering a sysTem able 
to browse distributed Information sources. First, a future EOS 
Information System must provide facilities to locate daTa wmch Is 
potentially of Interest. This Includes both Identifying archives or 
repositories where the data may be stored, as wel I as selecting tne 
relevant Items from within the archive based on appropriate geographic 
location, sensor characteristics, eTc. Second, we musT be able to view 
either the data Itself, or relevant aspects of the data set. We wii I 
emphasize the latter function In our proposed testbed development. 


Locating Data 


Data archives and repositories throughout tne scientific community 
are beginning to understand how to make data easily available to users. 
One of the first steps is to prepare data d I rector i es . which are 
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essentially lists of the available datasets which are kept In the 
archive. This Is sometimes cal led meTadafa: daTa abouT tne data. Such 
metadata Is often kept In a data base management system (DBMS), to 
permIT easy access to the contents of the database, as we I i as 
manipulation of the directory to find relevant material. 


Important efforts at interfaces to large data col lections are 
beginning through several agencies. Including NSSDC, the NASA Data 
Pilot programs, USGS, EOSAI, SPOl, and the Research Libraries Group. 

In paral lei with these development efforts, we are seeking provate 
foundation funding to automate a directory to tne UCSB Map and Imagery 
Collection - a. col lection of approximately 3.2 mi I 1 ton Image and 
300,000 maps. We propose to use the results of % these new developments, 
col laboratlng with these research and development efforts where our 
staff expertise wll I be helpful. 


There are tour general types of Information we would like to be 
able to consider when looking for a specific element of data. These 
ares 


- I ocator : 

geodetic coordinates that specify the area of Interest. For 
locations on the earth‘s surface, a I ati tude/’ iongitude pair is the 
most universal ly understood coordinate system. Auxiliary 
coordinate systems, such as the Landsat path/row system, should be 
included where appropriate. Minimum useful Information would be 
the center of an object; more useful would be a series of (x,y) 
locations for the ooundary. 

- object: 

properties of the data object itself. This Includes a unique 
standard descriptor (i.e., scene ID, map name), date the data was 
compiled or acquired, and an indication of data quality. 

- themes of coverage: 

for a mul tlspectral sensor, this might include the specifications 
of the spectral bands (i.e, bandwidth and center passband 
wavelength); for a derived product, this might be tne algoritnm, 
or the resulting classification (I.e, transformed vegetation 
Index, or soil moisture). 

- access: 

the location of the data itselt, as we I I as means to acquire or 
view the data. This would also include Information about the 
data format. 

An example of a directory to hold information about remotely-sensed 
data might look I ike the fo I I ow I ng: 
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Scene Path/ Scene Cloud 

Center Row Sensor ID Scale. DaTe Cover Access Format 

35 1 5'N 10/30 TM 840425768 30m 4Jan85 10% EOSAI 1 6UUBP I 

119 10'W BSQ 

This kind of organization of Information about the data holdings 
suggests a relational database structure. In particular, we should 
emphasize that each field In this dataset could be the STart ot a 
search for data. For example, we could begin by specifying the ground 
location of a test site, or by specifying a range of dates, or a 
combination of sensor and data qual Ity. Ful I boolean operations on the 
data r I e jds^must oe possible. 


There are systems that already contain this kind of metadaTa. 
Commercial DBMS's may be a good start towards building a useful 
capability for the scientific community. It Is unreasonable to expect 
that existing data col lections will necessartlly place their data 
eitner within a future Earth Observing Information System (E0IS), or 
obey some not-yet-deterralned standards for Interface to the E0IS. Two 
issues immediately confront a potential user: gaining access to 
distributed directories, and working with these distributed directories 
as I t tney were homogeneous. 


For this proposed work. It Is Important to STay aware ot NASA's 
near-term and medium-term networking plans (l.e., PSCN). We believe 
that access to distributed directories and catalogs can be dealt witn 
at several levels. For scientists with frequent need to access the 
systems, dedicated attachments to NASA's networks may be appropriate. 
For those in the community with less-frequent needs, dial-up and batch 
access facilities may meet their requirements Inexpensively. We 
propose to provide a free telephone number (800 or WATS service) to our 
facilities for our col I aborators In this project, to exercise tms low- 
cost option for providing service. 


Any number of Institutions are working towards on-line query of 
catalogs and data directories (l.e., UCSB Map and Imagery Laboratory, 
NSSDC, E0SAT, UStiS, SP0I). It Is not reasonable to expect that a given 
user understand the details of working with a large number of separate 
data directory systems. For spatial data, we bel leve tnat, while the 
explicit syntax of data base queries w I I I vary between systems, the 
underlying semantics are quite restricted. This presents an 
opportunity for a browse facility to appear to be homogeneous to an end 
user, where In fact the databases are distributed around tne country, 
and there are different database query languages involved. 


Approaches that should be studied include: 

1 - developing a standard for data catalog and directory query 

2 - developing software to convert data caTalog and directory queries 
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to and from a specified set of query languages 

3 - construcnng a central cata I og/d I rectory node on a network, which 

routes queries to appropriate databases and translates between 
query languages 

4 - developing transportable software to run on local processors for 

network routing and query translation 

5 - automated assistance to efficiently decompose complex queries to 

minimize search and network costs 

6 - automated assistance to make the system friendly and efficient to 

a wide range of users 


viewing Data 

Once relevant data sets have been located tnrough tne mechanisms 
above, the user must have some capability to examine the data. Some 
simple forms of examination may be built Into tne directories described 
above. For example, data quality may be one of the fields In the data 
directory. A useful distinction can be made between data kept online 
for some period of time. In comparison to offline materials that would 
require some operator action before viewing Is possible CI.e., mounting 
a tape or optical disk, duplicating a sheet of microfiche}. 


We can structure the viewing of data into tnree levels: 

1 - viewing the data Itself. This may Involve making a portion of 

the data available to the user via on-line methods, or pre- 
computed browse products may be made routinely (such as Landsat 
microform browse Images today). 

2 - viewing data attributes. This may Involve Information in the 

data directory (as In the digital data quality), or possliy In an 
Image header (ephemerls data, for example). 

3 - viewing derived data. This ranges from Information derived from 

the data values (such as mean and variance of specified spectral 
banos), to more sophisticated derived products (such as data 
rectified to some map projection, or a transformation of the data 
according to some model). Useful derived data tor one disci pi ine 
may not be particularly useful to others; the meetings we propose 
In the first year will help us to identify tne needs and desires 
of the NASA and NASA-related science communities In this area. 
While the needs of operational (e.g., NOAA) and commercial (e.g., 
EOSAT, SPOT) users are Important, we feel that focusing on the 
needs of the diverse NASA science community wli I provide an 
appropriate test of the system for this proposed effort. 


Once the user nas identified the desired characteristics of tne 
data to be viewed, there are a number of specific questions that arise. 
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We propose to address several of these during the first year of this 
effort. In conjunction wltn the user's meetings we propose. These 
questions Include: 

1 - are there some set of viewable products tnat might eitner be pre- 

computed and stored, or where dedicated hardware might speed 
their extraction? 

2 - for different disciplines, are there daTa transformations tnat 

might be applied to the viewable data that enhance the data's 
value? 

3 - what level of data compression might be appl led. to tne data, 

balancing data resolution, time, network costs, pre-processing 
costs, and local data reconstruction costs? 

4 - are there relatively standard hardware configurations that the 

browse system must support at various levels of performance? 

5 - are there specified data formats already available for the data 

that must oe passed between systems? 

6 - what levels of services can be provided to users with different 

levels of sophistication and data processing hardware? 


For a first level analysis, we anticipate providing different 
levels of service based on at least 3 different kinds of hardware at a 
user site: 

1 - dumb alphanumeric terminal 

2 - smart alpha terminal with known video attributes and 
addressab 1 e cursor 

3 - graphics workstation 


1 - 'Dumb' terminal query of the database. 

Minimum hardware - almost any dumb terminal, hardcopy or soft. 
Commano-dr l ven SQL-1 Ike Interactive session: 


I FIND (IMAGE) WHERE (LOCATION » 30N 116W) I 
I WHERE (SCALE > 1:24500) I 


2 - Terminal query with smart prompts. 

Minimum hardware - alphanumeric terminal witn cursor addressing 
prompted query language: 
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FIND ( ANYTHING 
( MAP 
( IMAGE 

( DIGITAL DATA 


WHERE (LOCAIION 
( DATE 
( SOURCE 
( DATA QUALITY 
( SCALE 
( THEME 
( FORMAT 


The screen might present a menu of this form, ana tne user can move a 

highlight around the screen to select Items. When an Item Is 

high I ighted, r .a pull-down screen. helps prompt for reasonable answers: 




IND ( ANYTHING 
( MAP ' 

( IMAGE 

( DIGITAL DATA 


WHERE ( LOCAIION 
( DATE 
( SOURCE 
( DATA QUALITY 
==>> ( SCALE <<== 


SCALE OPTIONS: 
1:1 OUU UuU 
1 : 250 000 
1: loo tiuo 

1: 62 500 

1: 24 5u0 

p I ease enter 
your request; 
< 4 > are 
permitted. 


And of course, pull-down menus should be available at any time for 
heip. We emphasize that menus can be a help to a less-experienced 
user, and a burden to an expert. User Interfaces must provide both 
friendly, eng I Ish- 1 anguage menu-level assistance to some users, ana an 
efficient, terse mode of access to others. 


3 - Graphics terminal 


Here, the smart prompting may be combined wltn a base map display 
and Interactive graphics to select geographic regions of Interest. 
Local Intelligence In the graphics terminal can help drive tne system, 
minimizing network traffic and response time. 


The base map display must be capable of zoom/ roam, wim real-time 
reporting of the I atltude/ I ongltude of the cursor, and any selected 
region of interest. The user has the option of highl ighting tne 
effective ground location of any object that is retrieved from the 
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database. 


For example, using the smart menu above, the user could narrow 
the search to sensors with specfled spectral bands, as we I I as a 
specified range of IFOV. Next, the user calls up a base map of the 
U.S., zooms Into Central California, and paints a round area of 
InTerest by specifying the dimension of the region. The system 
derives the appropriate I atltude/ I ongltude window of Interest from the 
displayed graphics, and starts the search. CenTerpoInTS (or 
optionally, outer boundaries) of the retrieved records are plotted on 
tne Dase map display. The user can then high I ight tne graphic 
representation of a particular object, and get a screen display that 
tells about that particular object. 


Options to continue narrowing the search, change tne parameTers 
of Interest, and let search parameters be based on other searches 
(••find me a map that corresponds to tne I asT Image you found for me") 
need to be Included. 


These views of the operation of an InTegraTed browse system are 
only a framework. We believe that we must work In conjunction with 
users In different dlsclpl ines, as wel I as In conjunction wim tnose 
Institutions developing data catalogs (for example, NSSDC and 
NOAA/NESO IS) and directories. In this way, we can adapT our 
development efforts to take advantage of efforts (such as the NASA 
Data Pilot developmenT activities) that are leading towards EUS and 
the EOIS In the 1990's. 


WORK PLAN 


We view our efforts as a part of a three-year period of basic and 
applied research and developmenT. While this propsal discusses our 
concept for the three year phased effort, we have proposed a budget for 
only the firsT year. We understand tnat to I low-on funding Is 
contingent on both our performance and the availability of funding. 


Year 1 provides us wltn necessary user Input, while developing 
tools for the community at large. Year 2 begins the principal 
developmenT eftorTs, as well as consideration of advanced technology 
approaches where appropriate. Year 3 Includes Implementation and 
tesTing of a pilot browse system for a designaTed group of users. We 
outline the three years In brief In the fol lowing paragraphs; our 
budgeT in tnis proposal covers only the tirsT year. We anTlcIpaTe tnat 
the second year will require somewhat higher levels of funding that the 
firsT year, wiTh requesTed expenditures during tne tnird year aT firsT 
year levels. 
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User Meetings^ We propose to convene two meeTlngs of selected 
scientific users (from NASA laboratories as wel I as the University of 
Michigan, University of Wisconsin, and Purdue UntverslTy, among omers) 
and staff from potential data set repositories and archives (JPL/Ocean 
Data System, NSSDC, PLDS, EOlS, NOAA). The tirsT meeting wli I, In 
part, set an agenda for the second, larger meeting. In conjunction with 
represenrati ves, this planning meeting will Identity key Issues and develop 
position papers which will form the basis for discussion at the later 
meeting. The purposes of the larger meeting are to Identity suoseTs 
of the needs of users In different dlsclpl Ines, and Identify the nature 
ana range of services that may be avai lable from agencies wlTn major 
data holdings In the near-term as wel I as In the Space Station Era. We 
also propose to send one senior sTaft member to tne International 
Symposium on Remote Sensing of the Environment In Nairobi. This 
conterence Is sponsored In parT by the United NaTlons Environment 
Program, and an excel lent opportunity to make contact with the 
International community, to acerrain and understand tneir InTeresTs In 
browsing large geographic datasets. 


* Function Test and Development: A variety of generic functions 

are necessary to a browse system. In the first year, we will begin to 
develop and test these functions, using co! leagues born In NASA and 
with NASA-funded research Interests as "clients”. This Involves 
developing a tesT bed configuration, accessible to ouTslde users (via 
NASA networks If available, as wel I as dial-up 800 WATS service). 

Tests involve data compression, daTa display, graphics Interface 
proceedures, and data format and communications standards. 


Co I I aboratlon with Other Sites: In part as a result of tne user 

meeting, we hope to formalize collaboration with other users working 
In similar areas, as well as wlTn STaft In agencies wlTn slgnlTicanT 
data holdings. These collaborations must Include shared software 
development and testing, dataseT exchange, and bench tesTs of 
developed user aids and software components. 


Ill I tl ate Elan nlng for Future Deve I opments : Towards tne end of tne 
first year, we will develop detailed plans for the second year’s 
research efforts, outlined below. In addition, we will begin 
discussions with Headquarters and other NASA personnel on future 
project deliverables. 


Xs.ar I*. 


Eel nc I pa I Software Developments: Software to be able to browse 

data catalogs and directories from remote sites. Software to be able 
to view data stored at remote sites. Software for daTa query language 
conversion . 
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pvp I oration of Advanced Technology; Consideration, and where 
appropriate, development of specifications for advanced technology 
(such as artificial Intelligence techniques) for the test bed. 
Functions that might beneriT from this Include daTa compression, query 
decomposition and translation, distributed problem solving, network 
operation and managemenT, ana data formaT conversion. 


Users Meeting; Based on those col laboratlons begun In the first 
year, monitor the effectiveness of tools and techniques developed in 
the early phases of the program. 


Testbed Evaluation: Provide access to specified dlsmbuTed 

datasets from the testbed, as well as access to the testbed by 
specified users offsite. Evaluate the performance of the system. In 
terms of meeting user requirements In the near-term as we I I as In terms 
of developments In NASA's network facilities and In the EOS sysTem 
Itsel f . 


.Year 


In the final year of this proposed efforT, we w I i I continue tne 
work begun In the first two years, and Integrate advanced technology 
developments Into the testbed. In particular, we will continue our 
examination of NASA Data Pilots and Information systems. We will make 
recommendations to NASA headquarters on Integration of the results or 
our work Into ongoing data pilot activities. Further, In this final 
year we wi I I provide software, and assisT software development for 
specified users and agencies providing data. Final I y, we wil I make our 
testbed system aval table to our col I aborators over a network, for final 
system test and evaluation. 


PROPOSED PROJECT DELIVERABLES 


There are several products to be provided as a part of this 
proposed research efforT. A semiannual progress and annual or final 
report wil I describe the bulk of our efforts. The semiannual progress 
report will Include discussions of work accompl I shed to date, summaries 
of user experiences with the testbed system, and discussions of work 
ana a budget tor the to I lowing year. Also Included will be copies of 
reviewed journal articles coming from this research effort. The final 
report will include discussions of a I I tne work accomplished during tne 
period of the grant. 

Additional ly, copies of software and relevant documentation 
developed under this proposal will be made available to to the sponsor. 
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CONCLUSIONS 


The chai i enge before NASA and the InTernaTional scientific 
community Is to continue to develop both the Infrastructure and 
expertise which wli I Insure tnat the EOS Information System works 
properly. On the one hand, we must continue to develop the science and 
technology of remote sensing. This Includes research on Improved 
communications and advanced processing techniques, to natural language 
Interfaces and advanced scientific workstations, as wel i as new sensor 
technology. The proposed research effort on browse of EOS and 
ancl i lary^jata embodies, to some extent, al l of these. In addition, 
the work to be performed In this proposed research effort provides a 
bridge to the science community. This bridge. If effectively built, 
will provide a pathway which can speed our progress towards global 
science and an improved understanding of the dynamic of our global life 
support systems. This improved understanding is the cornerstone, the 
true goal of the EOS system. 
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